New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I found a flaw: "Bot is cooking for too long." #139
Comments
This is the same issue I've encountered as well. I've opened a ticket to report this problem before. I believe it won't be long before it gets fixed. Let's wait for the next update... |
which data source is causing the issue, the docx or the pdf? I know this issue occurs with the railway and works fine locally. I will be looking into a solution |
Both |
I have updated the railway template, which may fix the file processing issues. |
@n4ze3m I waited 3 hours and still got the same problem. Nothing has changed. The problem has not been completely resolved. My file has approximately 500-1000 pages. How many pages did you test the document for? Please try 500-1000 or more pages and you will encounter this problem. |
Is this issue related to the railway or is it local? For Railway,I think you need to reinstall the railway template. The old one doesn't have a Docker mount, which may be causing the issue. |
railway I tried it and your latest version is 1.4.1. |
Hello, can you reinstall your railway app? The latest update has mounted an upload folder, preventing the deletion of uploaded files. Railway template: https://railway.app/template/TXdjD7 I have tested a 758-page PDF, approximately 17 MB, using Cohere embedding, and it's working without any issue. PDF I tested: https://www.microsoft.com/en-us/research/uploads/prod/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf |
Please carefully watch the video. Do not fast forward or skip, as there are explanations that you need to read. https://streamable.com/afx0tcI have tested on the "Railway" again, and it seems that I am still encountering the same issues. Here are my observations:
|
I will look into it. I think the issue is with the DOCX loader.
I will test with the OpenAI API, as I think the issue may be caused by a rate limit. I will look into it Currently, you cannot delete a data source while it is processing. I will update the error label. |
Hello, I have released a new update which addresses the issue with the docx loader. This update has been tested on a 700+ page docx document on railways using the text-embedding-ada-002 model. The processing time for the file is approximately 2-3 minutes. teste docx link: https://docs.google.com/document/d/18-ETRBO4yRpRl3nF68P8vTbunlBgdy_t/edit?usp=sharing&ouid=108531690400573042017&rtpof=true&sd=true demo.mp4 |
@n4ze3m |
:| same docs ?? |
I'm sorry, I don't fully understand what's happening. If you are using Railway, I highly recommend deleting the existing application and creating a new one from the latest template. I have tested it on a new Railway application. |
Is it necessary to delete the database on Supabase? I've tried reinstalling the app excluding Supabase and reinstalling it from scratch, but it still doesn't work. Do I need to delete the database to start fresh? |
No, Make sure your database has enough space. Embedding takes up a lot of space I just tested the application on the railway, and it works perfectly for me. Here is the uncut version: brave_Bf0jqbXYDB.mp4 |
@n4ze3m |
@n4ze3m |
@yoobaring : I've run into the same issue multiple times while testing on railway and similar services. While everything was working fine on my local environment, there was this issue with large files on cloud services. At the end it was a simple issue of scaling. Just ensure that your runtime environment has at least 4 gigs of ram and 4 dedicated CPUs. To fix the issue temporarily, simply go into the database table which contains the latest file references, and remove the one on which your boot is hanging. |
Hi @n4ze3m I have encountered an issue. The bot is cooking for too long. I've found this problem in documents with multiple pages, but for a small number of pages, I haven't encountered this issue. I hope this issue will be resolved. Please note that I'm running tests on a railway.
The text was updated successfully, but these errors were encountered: