WebCrawler

A powerful and extensible C# console web crawler that recursively visits URLs, supports filtering, and exports discovered links to a file.

Screenshots

cd src/WebCrawler
dotnet run

You will be prompted to enter a starting URL.
Optionally, enter filtering criteria:
- Allowed domain (e.g., example.com)
- Allowed extensions (.html, .php, etc.)
- Keywords to include or exclude in URLs
The crawler will process the site and save all valid links to crawled_links.txt.

You can modify filters or concurrency settings inside:

MIT License — use freely, modify boldly.