Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error 1: Likes: Error parsing post #374

Open
nateify opened this issue Oct 29, 2022 · 21 comments
Open

Error 1: Likes: Error parsing post #374

nateify opened this issue Oct 29, 2022 · 21 comments

Comments

@nateify
Copy link

nateify commented Oct 29, 2022

Describe the bug
Error 1 parsing when downloading likes

To Reproduce
I am downloading my likes directly by the URL tumblr.com/likes while logged in, accepted privacy statement, using Tumblr SVC API. It gets most of the way through but fails at a certain point.

Screenshots
Some screenshots of the setup:
https://files.catbox.moe/krzeos.png
https://files.catbox.moe/g8pbwx.png

Log file right after the error occurred: https://pastebin.com/raw/jYUbmqGG

Desktop (please complete the following information):

  • TumblThree version: 2.8.3.357
  • OS: Windows 10
@thomas694
Copy link
Contributor

Thanks for reporting a bug. We'll take a look at it.

thomas694 added a commit that referenced this issue Nov 1, 2022
- There was an error that prevented the parsing of reblogged text posts with no added content.
- Now normal and reblogged text posts can be downloaded.
@thomas694
Copy link
Contributor

The issue has been fixed and closed. You can still comment. Feel free to ask for reopening the issue if needed.

@nateify
Copy link
Author

nateify commented Nov 1, 2022

Hi, thanks for the update. I am running into the same Error 1 parsing though it stopped on a different post and with a new log:
https://pastebin.com/raw/RizkyUvM

@thomas694
Copy link
Contributor

That wasn't intentional.
Your screenshots and log excerpts always help to get directly to the problem location. 👍🏻 Better than a statement "all my blogs fail", even if you doubt it, because there are successful tests with some sample blogs before a new version is released.

Could it be that now a different kind of text posts like quotes, question and answers fails? Are they just reblogs or was new content added?
If you don't mind, you could provide (through our feedback dialog or my profile email) links to failing posts, it would help even more.

@thomas694 thomas694 reopened this Nov 1, 2022
@nateify
Copy link
Author

nateify commented Nov 1, 2022

That wasn't intentional. Your screenshots and log excerpts always help to get directly to the problem location. 👍🏻 Better than a statement "all my blogs fail", even if you doubt it, because there are successful tests with some sample blogs before a new version is released.

Could it be that now a different kind of text posts like quotes, question and answers fails? Are they just reblogs or was new content added? If you don't mind, you could provide (through our feedback dialog or my profile email) links to failing posts, it would help even more.

Where can I find the post it fails on?

@thomas694
Copy link
Contributor

It's not directly logging which posts or urls fail. But maybe if the errors already start directly after pressing "download", you can see which are missing by comparing it to the version you see in your browser. Or you could add the url as a new blog, or use a new "download location" for testing.

@nyckidryan
Copy link

nyckidryan commented Nov 4, 2022

Same issue downloading via http://www.tumblr.com/liked/by/[blog]

Downloaded items: 46
Newly downloaded items: 46
Number of downloadable items: 48
(actual blog likes: over 59,000)

Downloading images only, all other download options turned off, stock config/fresh installation:

tumblthree-config

Error 1: [blog]: Error parsing post!

From TumblThree.log (newlines added for clarity):

20221104 03:36:19.216 Inf TumblThree start
20221104 03:36:19.217 Inf AppPath: J:\TumblThree-v2.9.0
20221104 03:36:19.217 Inf AppSettingsPath: D:\Users\ryan\AppData\Local\TumblThree\Settings
20221104 03:36:19.217 Inf LogFilename: J:\TumblThree-v2.9.0\TumblThree.log
20221104 03:36:19.217 Inf Version: 2.9.0.363
20221104 03:36:19.218 Inf IsLongPathSupported: True

20221104 03:36:45.576 Err TumblrLikedByCrawler:CrawlPageAsync: System.Exception: 429 - Limit exceeded.
at TumblThree.Applications.Crawler.AbstractTumblrCrawler.RetrieveOriginalImageUrl(String url, Int32 width, Int32 height, Boolean isInline) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\AbstractTumblrCrawler.cs:line 446
at TumblThree.Applications.Crawler.AbstractTumblrCrawler.AddTumblrPhotoUrl(String text, Post post) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\AbstractTumblrCrawler.cs:line 206
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.AddPhotoUrlToDownloadList(String document) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 555
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.d__26.MoveNext() in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 524
--- End of stack trace from previous location where exception was thrown ---
at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.d__16.MoveNext() in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 208

20221104 03:36:48.662 Err TumblrLikedByCrawler:CrawlPageAsync: System.Exception: 429 - Limit exceeded.
at TumblThree.Applications.Crawler.AbstractTumblrCrawler.RetrieveOriginalImageUrl(String url, Int32 width, Int32 height, Boolean isInline) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\AbstractTumblrCrawler.cs:line 446
at TumblThree.Applications.Crawler.AbstractTumblrCrawler.AddTumblrPhotoUrl(String text, Post post) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\AbstractTumblrCrawler.cs:line 206
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.AddPhotoUrlToDownloadList(String document) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 555
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.d__26.MoveNext() in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 524
--- End of stack trace from previous location where exception was thrown ---
at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.d__16.MoveNext() in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 208

20221104 03:36:50.586 Err TumblrLikedByCrawler:CrawlPageAsync: System.Exception: 429 - Limit exceeded.
at TumblThree.Applications.Crawler.AbstractTumblrCrawler.RetrieveOriginalImageUrl(String url, Int32 width, Int32 height, Boolean isInline) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\AbstractTumblrCrawler.cs:line 446
at TumblThree.Applications.Crawler.AbstractTumblrCrawler.AddTumblrPhotoUrl(String text, Post post) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\AbstractTumblrCrawler.cs:line 206
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.AddPhotoUrlToDownloadList(String document) in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 555
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.d__26.MoveNext() in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 524
--- End of stack trace from previous location where exception was thrown ---
at System.Runtime.ExceptionServices.ExceptionDispatchInfo.Throw()
at TumblThree.Applications.Crawler.TumblrLikedByCrawler.d__16.MoveNext() in C:\projects\Tumblthree\src\TumblThree\TumblThree.Applications\Crawler\TumblrLikedByCrawler.cs:line 208

@thomas694
Copy link
Contributor

Same issue downloading ...

Not same issue, maybe similar error message shown, but actually different reason.
Here it's a "429 limit exceeded" error. Tumblr says you downloaded too much in the measured time frame. You probably did and maybe are using more than one instance (web browser, app) to access their site at once.

We'll change how this error message is displayed.

thomas694 added a commit that referenced this issue Nov 14, 2022
@IThatAngel
Copy link

103 days since the last comment on this, I'm having the same issue here, and it's not just affecting my likes link. I tested it on another blog and the issue was there too. Most of the errors I got from them were from parsing posts (1).
After it downloads a couple tens or hundreds of posts (I noticed the size depends on the size of posts of a clog), it stops after a certain point.
Right-clicking a selected process via "enqueue selection" only scans the number of downloadable items and increases it, without downloading anything. (This mainly happens in my liked selection).
The download process is practically stuck. No changes in the downloaded items tab, nothing, nadah.

Likes
TumblThree_oX0XNu9vQd
TumblThree_JUXETQeEZR
https://www.tumblr.com/likes
(The number of liked posts in Tumblr itself was 5,5k)

The solo blog in question.
TumblThree_KvVOXGSFim
TumblThree_RqZWFhw2mg
https://floatingchromeorb.tumblr.com/archive

Tried switching crawler detection, Still nothing.

Last thing I'd like to mention, I tried tinkering in the settings, switching some log setting to "error" fetched me this.

TumblThree_4PGdd6Y5uD

I switched it off to try something else, later I tried switching this back on and error 2 never showed up for me again.

  • TumblThree version: 2.10.0.384
  • OS: Windows 10 Home

I'm not a deeply technical person, since I couldn't find any log file that was mentioned earlier, I hope you understand.

@thomas694
Copy link
Contributor

@IThatAngel

You can find the log file here (%LOCALAPPDATA%\TumblThree\TumblThree.log), you can copy & paste it in windows file explorer.

Error 2 was a temporary error, something wasn't downloadable at that time. If the error is gone it's ok.

The mentioned blog is a small one. Comparing the downloaded items against the one in the browser it showed that all has been downloaded, only the number of 15 downloaded images isn't added to the numbers for some reason.
We'll have a look.

@IThatAngel
Copy link

Found the log in question. (user folder name will be replaced in here with █ █ █ for privacy concerns, sorry about that)

20230309 13:46:09.372 Inf TumblThree start
20230309 13:46:09.373 Inf AppPath: G:\Users\█ █ █\Documents\TumblThree-v2.10.0-x64-Application
20230309 13:46:09.373 Inf AppSettingsPath: G:\Users\█ █ █\Documents\TumblThree-v2.10.0-x64-Application
20230309 13:46:09.373 Inf LogFilename: G:\Users\█ █ █\Documents\TumblThree-v2.10.0-x64-Application\TumblThree.log
20230309 13:46:09.373 Inf Version: 2.10.0.384
20230309 13:46:09.373 Inf IsLongPathSupported: True

20230309 13:48:50.541 Inf TumblThree start
20230309 13:48:50.542 Inf AppPath: G:\Users\█ █ █\Documents\TumblThree-v2.10.0-x64-Application
20230309 13:48:50.542 Inf AppSettingsPath: G:\Users\█ █ █\Documents\TumblThree-v2.10.0-x64-Application
20230309 13:48:50.542 Inf LogFilename: G:\Users\█ █ █\Documents\TumblThree-v2.10.0-x64-Application\TumblThree.log
20230309 13:48:50.543 Inf Version: 2.10.0.384
20230309 13:48:50.543 Inf IsLongPathSupported: True

@thomas694
Copy link
Contributor

Sure, you can mask personal details.
But as you may have seen both blocks contain the same information except the timestamps. It's the current version running in portable mode.
A part with an error message would help.

@thomas694
Copy link
Contributor

@IThatAngel
Please try the newest version.

@IThatAngel
Copy link

Sorry for the late response, I only got notified by this via email. I downloaded the latest version as you asked.
So far I had no issue downloading smaller blogs, however when I attempted to download liked posts again, it failed to detect an account... which I had already authenticated.
I tried uninstalling and reinstalling it +authenticating again. And I still get the same error with the liked posts.

That's all I think of to mention at this moment.

@thomas694
Copy link
Contributor

Please try the new test version and check if the authentication works better for you.
You can conveniently extract the zip file into a new folder, no need to remove your normal installation folder for testing.

@IThatAngel
Copy link

Please try the new test version and check if the authentication works better for you. You can conveniently extract the zip file into a new folder, no need to remove your normal installation folder for testing.

I downloaded the version you provided. Sadly, the authentication issue still persists. I even logged in before trying to download likes again.
TumblThree_4wVijmNUAL
TumblThree_MwIdqEwKr6
On the bright side, at least I can download other blogs with no issues now,

@azizomar9
Copy link

@thomas694 I am having the same issue as @IThatAngel was having as well. I can't download my likes anymore because it says I am not logged in when I am. Is there a solution to the issue yet?

@thomas694
Copy link
Contributor

Please try the newest test version and let us know the results.

@IThatAngel
Copy link

Please try the newest test version and let us know the results.

Aaaand nothing has changed...
Still can't download my likes, but can for other blogs.
Even deleted previous folders before running this one and authenticating.
Still nadah. Oh well.

thomas694 added a commit that referenced this issue Jun 20, 2023
- The crawler didn't succeed any longer with the logged-in check.
@joaobarros
Copy link

I had the same not logged in error and could only get it to work when I changed to https://www.tumblr.com/liked/by/[blog name] format but I see it was already tried that way.
The normal like format should work or the /liked/by/ documented as it can get quite frustrating to get it going.

@thomas694
Copy link
Contributor

@ all
Please try the newest test version and let us know the results. Before doing the test download of your Likes, delete/rename the old log file, set the Log Level to Verbose and ensure you are logged in.

Does it fail to download posts from the beginning or after a while?
Please keep the last 1-3 pages from the log file and send the zipped file and any useful info by email, or upload it to e.g. Pastebin and send this link by email or feedback dialog.

thomas694 added a commit that referenced this issue Aug 26, 2023
…tinue

- Occasionally something happens to the cookies and Tumblr returns an auth error on the next request.
- In case of an auth error the cookies are fetched from the WebView instance again and the request is repeated.
- It's not totally clear yet why this happens.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants