WebScraper

WebScraper is a Python project for extracting all links from a given website. It features both a command-line interface (CLI) and a modern, dark-themed graphical user interface (GUI) built with Tkinter.

Features

Extracts all links from a specified website
Saves results to a timestamped text file
Dark-themed, user-friendly GUI (Tkinter)
CLI usage for quick scraping
Error handling for invalid URLs and network issues

Requirements

Python 3.7+
See requirements.txt for dependencies:
- requests
- beautifulsoup4
- lxml
- pyfiglet

Installation

Clone the repository:
```
git clone <repo-url>
cd WebScraper
```
Install dependencies:
```
pip install -r requirements.txt
```

Usage

Command-Line Interface (CLI)

Run the scraper from the terminal:

python main.py <site_url>

Example:
```
python main.py example.com
```
The results will be printed to the console and saved in the Results/ folder.

Graphical User Interface (GUI)

Start the GUI:
```
python Modules/gui.py
```
Enter the website URL (e.g., https://example.com) in the input field.
Click the "Scrape" button or press Enter to start scraping.
All found links will be displayed in the results area.
Click "Save Results" to export the links to a file in the Results/ folder.

ScreenShots

This is the initial look of the application when you first open it.

After entering a URL and clicking 'Scrape', the application scans the website for links.

All found links are displayed in the results area, ready to be saved.

Folder Structure

WebScraper/
├── main.py              # CLI entry point
├── requirements.txt     # Python dependencies
├── README.md            # Project documentation
├── Results/             # Scraped results are saved here
├── Modules/
│   ├── scraper.py       # Scraping logic
│   ├── greeting.py      # Welcome message logic
│   └── gui.py           # Tkinter GUI

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Img		Img
Modules		Modules
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WebScraper

Features

Requirements

Installation

Usage

Command-Line Interface (CLI)

Graphical User Interface (GUI)

ScreenShots

Folder Structure

About

Uh oh!

Releases

Packages

Languages

MrShiroLu/WebScraper

Folders and files

Latest commit

History

Repository files navigation

WebScraper

Features

Requirements

Installation

Usage

Command-Line Interface (CLI)

Graphical User Interface (GUI)

ScreenShots

Folder Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages