content-scraper

Multi Scrapper AI is an all-in-one content intelligence tool built with Streamlit that extracts, summarizes, and analyzes YouTube videos, PDFs, and websites. It uses Gemini 2.0 Flash and Ollama models to generate structured summaries, answer questions, and convert long content into actionable insights.

python nlp web-scraping data-extraction beautifulsoup pypdf2 pdf-parsing gemini-api content-scraper streamlit youtube-transcript ai-tools llm ollama

Updated Dec 1, 2025
Python

REDFOX1899 / content-scraper

Star

🚀 A powerful, extensible content scraping system for collecting authentic content from public figures across Twitter, YouTube, blogs, podcasts, and books. Built with AI-powered processing, authenticity validation, and vector embeddings.

python nlp machine-learning youtube twitter ai scraping embeddings web-scraping data-collection balaji content-scraper vector-database tim-ferriss

Updated Nov 5, 2025
Python

mdazlaanzubair / chrome-ext-scrapy

Star

Data Extractor is a Chrome extension that enables users to extract and log data from webpages with ease. Capture complete webpage content and save it directly to a text file. Simple setup and real-time feedback make data extraction straightforward and efficient.

javascript chrome-extension html chrome script webscraping scraping-websites content-scraper

Updated Aug 30, 2024
JavaScript

rishijha / sitemap-harvester

Star

🗺️ Harvest URLs and metadata from website sitemaps efficiently with this fast Python tool. Get organized insights for your digital projects.

python sitemap automation seo open-graph web-crawler web-scraping robots-txt data-extraction xml-parser meta-tags metadata-extraction cli-tool seo-tools content-scraper sitemap-parser website-analysis url-harvester

Updated Jan 2, 2026

Improve this page

Add a description, image, and links to the content-scraper topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the content-scraper topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

content-scraper

Here are 8 public repositories matching this topic...

meysam81 / sitemap-harvester

nghianguyen09 / spa-content-extractor-using-webview2

TJaySteno / P06-content-scraper

grashupfer99 / js-techdegree-project-6

A4xPraddy / MultiScrapperGenAi

REDFOX1899 / content-scraper

mdazlaanzubair / chrome-ext-scrapy

rishijha / sitemap-harvester

Improve this page

Add this topic to your repo