Skip to content

Web crawler for Burplist, a search engine for craft beers in Singapore

License

Notifications You must be signed in to change notification settings

ngshiheng/burplist

Repository files navigation

Burplist


CI CD

Context

Welcome to the official web crawler repository for Burplist built using Scrapy.

Growing up in a frugal family, I would spend hours browsing online, looking for the best bang for my bucks. Needless to say, the process was super exhausting and slowly turns into frustration.

So then I thought, why not just create a search engine for craft beers?

Read more....

Disclaimer

This software is only used for research purposes, users must abide by the relevant laws and regulations of their location, please do not use it for illegal purposes. The user shall bear all the consequences caused by illegal use.

Features

  • 10+ unique spiders for top craft beer sites in Singapore
  • Sentry integration
  • ScrapeOps integration
  • Scraper API for proxy requests
  • Automated random user agent rotation
  • Colored logging
  • Data deduplication pipeline
  • Database migration with Alembic
  • Delayed requests middleware

Requirements

Usage

See this documentation on how to use Burplist.

Contributing

For guidance on setting up a development environment and how to make a contribution, read the contributing guidelines.