mirror of
https://github.com/OneUptime/oneuptime
synced 2024-11-21 22:59:07 +00:00
c099f3a3ef
The error handling in app.py has been improved to catch and handle exceptions that occur during the processing of items in the queue. This change ensures that errors are properly logged and that the affected items are removed from the pending items list. |
||
---|---|---|
.. | ||
Models | ||
app.py | ||
Dockerfile.tpl | ||
Readme.md | ||
requirements.txt | ||
tsconfig.json |
Llama
Prepare
- Download models from meta
- Once the model is downloaded, place them in the
Llama/Models
folder. Please make sure you also place tokenizer.model and tokenizer_checklist.chk in the same folder. - Edit
Dockerfile
to include the model name in theMODEL_NAME
variable. - Docker build
docker build -t llama . -f ./Llama/Dockerfile
Run
For Linux
docker run --gpus all -p 8547:8547 -it -v ./Llama/Models:/app/Models llama
For MacOS
docker run -p 8547:8547 -it -v ./Llama/Models:/app/Models llama
Run without a docker conatiner
uvicorn app:app --host 0.0.0.0 --port 8547