PulsarRPA: An AI-Enabled, Super-Fast, Thread-Safe Browser Automation Solution! 💖
-
Updated
Jun 7, 2025 - Kotlin
PulsarRPA: An AI-Enabled, Super-Fast, Thread-Safe Browser Automation Solution! 💖
Fully automated and hands-free, accurately extracting and understanding web content — powered by machine learning agents.
Use LLMs to robustly extract structured data from HTML and markdown
基于Scala Akka的分布式主题网络爬虫
Automatic extraction of the information on local event from a webpage with Machine Learning
A powerful and lightweight web scraping library with LLM extraction capabilities. This library combines web scraping with AI-powered content extraction using either OpenAI or OpenRouter APIs.
Predicting product recommendation score using the data available on the website of the client
Programming assignments for Web Information Extraction and Retrieval, FRI UL, 2021. PA1: standalone webcrawler of .gov.si web sites, PA2: approaches of the structured web data extraction, PA3: Data processing and indexing and Data retrieval.
## WebExtractor WebExtractor is a Python tool for OSINT and ethical hacking that extracts email addresses, phone numbers, and links from target websites. It runs on Linux and Termux, providing a simple CLI interface for cybersecurity professionals to gather critical intelligence. 🐙💻
Add a description, image, and links to the web-extraction topic page so that developers can more easily learn about it.
To associate your repository with the web-extraction topic, visit your repo's landing page and select "manage topics."