A Python script for scraping data from the GEIPAN (Groupe d'Études et d'Informations sur les Phénomènes Aérospatiaux Non Identifiés). The script extracts information about UFO sightings, including the city, phenomenon, department, case classification, and observation date. The data is saved into a CSV file for further analysis.
- Data Extraction: Scrape data from multiple pages of the GEIPAN website.
- Data Processing: Organize the extracted data into a pandas DataFrame.
- CSV Export: Save the processed data into a CSV file.
- Python 3.x
- BeautifulSoup
- Requests
- pandas
- tkinter (for file dialog)
- Clone the Repository
git clone https://github.com/your-username/geipan-data-scraper.git
cd geipan-data-scraper
- Install Dependencies
Install the required Python libraries using pip
pip install beautifulsoup4 requests pandas
- Run the Script
nsure you have the script GEIPAN_Data_Scraper.ipynb in your directory. Then run the Jupyter notebook or convert it to a Python script and run it
jupyter notebook GEIPAN_Data_Scraper.ipynb
or convert to Python script
jupyter nbconvert --to script GEIPAN_Data_Scraper.ipynb
python GEIPAN_Data_Scraper.py
- Export Data
Follow the prompts to select the location to save the CSV file.
This project is licensed under the MIT License - see the LICENSE file for details.