Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

switch database to non-json #31

Open
darealdemayo opened this issue Feb 17, 2024 · 0 comments
Open

switch database to non-json #31

darealdemayo opened this issue Feb 17, 2024 · 0 comments

Comments

@darealdemayo
Copy link

darealdemayo commented Feb 17, 2024

Sorry if I missed something or this is a dumb question.

I setup docker according to readme and it works, both with local csv and json output and with json-database output.
however, I don't understand how to correctly switch the database to non-json/separate columns.

when I tried to switch to non-json, the scraping will just stop working and freeze without an error message after the third pagejob.
I tried that multiple times and it always freezes after 3 page jobs, I'm assuming you are waiting for 3 results before writing it to database?

I found and used the sql files, but those didn't help either. A few things I noticed:

json-up.sql drops the non-json fields, but json.down doesn't drop the json field, so I assumed that json not null blocks the script from populating the columns, but deleting the json column didn't help either.

I also noticed that the columns that are created by jsondown.sql are far less than the actual datapoints that are extracted, so I feel like this sql file is either faulty or I'm missing something important?

I would have assumed that the database should contain a column for all datapoints that are otherwise extracted as columns in CSV?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant