Commit Graph

16 Commits

Author SHA1 Message Date
Simon Larsen
1a02f5b73e refactor: Update queue_status endpoint response key name
The key name in the response of the queue_status endpoint has been updated from "prnding" to "pending" to fix a typo. This change ensures consistency and improves the readability of the returned data.
2024-06-22 12:43:04 +00:00
Simon Larsen
00b35c4d9a refactor: Convert job function to async in app.py
The job function in app.py has been converted to an async function to support asynchronous processing. This change improves the performance and responsiveness of the application by allowing other tasks to run concurrently while the job function is processing the queue.
2024-06-22 12:31:53 +00:00
Simon Larsen
94956b045a refactor: Improve comments in code and update job function in app.py
This commit refactors the code by improving the comments in the code and updating the job function in app.py. The comments are now focused on code that is hard to understand, and unnecessary comments are removed. The job function is converted to an async function to support asynchronous processing, which improves the performance and responsiveness of the application. These changes enhance the readability and efficiency of the codebase.
2024-06-20 10:00:07 +00:00
Simon Larsen
b2f650a865
refactor: Sanitize file path in CodeRepositoryUtil.getFileContent()
The file path in the CodeRepositoryUtil.getFileContent() method is now sanitized using the LocalFile.sanitizeFilePath() function before executing the command. This change ensures that the file path is properly formatted and prevents any potential security vulnerabilities or errors caused by invalid file paths.
2024-06-20 10:26:16 +01:00
Simon Larsen
9b08d1a9e4 refactor: Convert job function to async in app.py
The job function in app.py has been converted to an async function to support asynchronous processing. This change improves the performance and responsiveness of the application by allowing other tasks to run concurrently while the job function is processing the queue.
2024-06-19 21:05:36 +00:00
Simon Larsen
dbef1071e0 refactor: Add GPU support to Llama app in docker-compose.ai.yml 2024-06-19 20:58:08 +00:00
Simon Larsen
97cc28b182 refactor: Update Dockerfile.tpl to use huggingface/transformers-pytorch-gpu image
This commit updates the Dockerfile.tpl to use the huggingface/transformers-pytorch-gpu image instead of the continuumio/anaconda3 image. This change allows the Llama app to utilize GPU resources for improved performance in AI processing. Additionally, the unnecessary installation of the transformers and accelerate libraries is removed as they are already included in the huggingface/transformers-pytorch-gpu image.
2024-06-19 13:06:23 +00:00
Simon Larsen
df20f343e9
refactor: Update Llama app to log prompt and output to console
This commit updates the Llama app to log the prompt and output to the console for debugging purposes. It adds print statements to log the prompt before processing and the generated output after processing. This change improves the development workflow by providing visibility into the input and output of the AI model.
2024-06-18 22:08:42 +01:00
Simon Larsen
3f315be279
refactor: Update Llama app to use local model path instead of model ID
This commit updates the Llama app to use a local model path instead of a model ID. The model path is set to "/app/Models/Meta-Llama-3-8B-Instruct". This change improves the reliability and performance of the app by directly referencing the model file instead of relying on an external model ID.
2024-06-18 21:41:29 +01:00
Simon Larsen
26bb6f1e74
refactor: Update Dockerfile.tpl to expose port 8547 instead of port 80
This commit modifies the Dockerfile.tpl file to update the EXPOSE directive. The port number is changed from 80 to 8547 to align with the port used by the Llama application. This change ensures that the Llama application is accessible from outside the container on the correct port.
2024-06-18 18:42:11 +01:00
Simon Larsen
b3cfdbf45a
enable gpu on llama docker 2023-10-18 12:07:37 +01:00
Simon Larsen
c629921d01
fix llama docker file. 2023-10-18 11:01:15 +01:00
Simon Larsen
896dce3430
add pycache to gitignore 2023-10-16 20:54:21 +01:00
Simon Larsen
22c2231e22
make llama work with rest api 2023-10-16 11:45:15 +01:00
Simon Larsen
805139055a
make llama work 2023-10-15 21:04:58 +01:00
Simon Larsen
a59742cddb
add python app for llama. 2023-10-15 18:14:15 +01:00