Skip to content

Test work on using node to scrape blog from a jimdo site

Notifications You must be signed in to change notification settings

reformatco/jimdo-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Jimdo scraper using Node.js

Initial experiment using cheerio and sanitize-html to scrape a Jimdo blog enabling you to export content to another CMS.

To do

  • scrape pagination
  • scrape each pages within the blog
  • save to json file of all blog posts with links, date, title etc
  • iterate through each blog post saving data into
    • single json file
    • separate json files

into wp

  • create post with title
  • make first image element the featured image
  • import

About

Test work on using node to scrape blog from a jimdo site

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published