You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
simonbrunel opened this issue
Jan 31, 2024
· 0 comments
Labels
featureIssues that represent new features or improvements to existing features.t-toolingIssues with this label are in the ownership of the tooling team.
Which package is the feature request for? If unsure which one to select, leave blank
@crawlee/utils
Feature
Loading sitemaps using Sitemap.load() should give access to the other tags defined by the Sitemaps format: loc, lastmod, changefreq and priority.
Motivation
Sitemaps give information about when each page has been last modified, priority, etc... and while I'm sure there are other libraries to load sitemaps, it's easier to rely on the Crawlee utils instead (consistency and less dependencies). Since you already provide a Sitemap util, it should be relatively easy to expose other tags other than the url.
Ideal solution or implementation, and any additional constraints
featureIssues that represent new features or improvements to existing features.t-toolingIssues with this label are in the ownership of the tooling team.
Which package is the feature request for? If unsure which one to select, leave blank
@crawlee/utils
Feature
Loading sitemaps using
Sitemap.load()
should give access to the other tags defined by the Sitemaps format:loc
,lastmod
,changefreq
andpriority
.Motivation
Sitemaps give information about when each page has been last modified, priority, etc... and while I'm sure there are other libraries to load sitemaps, it's easier to rely on the Crawlee utils instead (consistency and less dependencies). Since you already provide a
Sitemap
util, it should be relatively easy to expose other tags other than theurl
.Ideal solution or implementation, and any additional constraints
Alternative solutions or implementations
No response
Other context
No response
The text was updated successfully, but these errors were encountered: