XY_JSFinder
XY_JSFinder is a Python script designed to extract hyperlinks from specified websites or multiple websites, supporting saving results to local files. It offers basic web crawling functionality along with deep crawling options.
Key Features
- Single Website Crawling: Crawl hyperlinks from a specified single website.
- Multiple Website Crawling: Read multiple website URLs from a file and crawl hyperlinks for each.
- Deep Crawling: Perform in-depth link crawling on a single website.
- Result Saving: Save crawled links to a specified output file.
Usage Examples
-
Single Website Crawling:
Command:python XY_JSFinder.py -u [Website URL] -o [Output File]
Example:python XY_JSFinder.py -u http://example.com -o output.txt
-
Multiple Website Crawling:
Command:python XY_JSFinder.py -f [Input File] -o [Output File]
Example:python XY_JSFinder.py -f input.txt -o output.txt
-
Deep Crawling:
Command:python XY_JSFinder.py -u [Website URL] -o [Output File] -d
Example:python XY_JSFinder.py -u http://example.com -o output.txt -d
Important Notes
- Ensure compliance with legal regulations and website crawling policies.
- Be cautious not to overload servers during deep crawling.
- Install required Python libraries such as
requests
andbeautifulsoup4
.
XY_JSFinder is a powerful tool for web scraping and hyperlink extraction, making it easier to gather data from the web efficiently.