Summary of "Собираем информацию с любого сайта в Excel таблицу без разработки: No-Code парсер за 15 минут"
Overview
The video tutorial by Dima Kapranov, a product manager specializing in commerce marketplaces, demonstrates how to extract data from virtually any website into an Excel spreadsheet quickly and without programming. It uses a no-code Chrome extension scraper tool to achieve this.
Key Technological Concepts and Features
-
Manual No-Code Web Scraping: Instead of relying on automatic scrapers that often produce incomplete or low-quality data, the tutorial emphasizes manually creating parsers using a Chrome extension integrated into Chrome DevTools.
-
Selectors: Users define “selectors” to specify what data to scrape, such as product links, titles, prices, descriptions, and reviews. Selectors can be of different types including link, text, image, table, and HTML element.
-
Multi-level Parsing: The scraper can drill down multiple levels (e.g., from product listings to individual product pages, and even seller pages) by creating nested selectors.
-
Pagination Handling: The tool supports pagination through a “paginator” selector that clicks “Next” or “Load More” buttons to scrape data across multiple pages.
-
Advanced Selectors: Includes scroll and click selectors to handle dynamic content loading (e.g., infinite scroll or content revealed after button clicks).
-
Sitemap.xml Parsing: For sites with complex navigation, the scraper can use sitemap.xml files combined with regular expressions to collect links more efficiently without clicking through pages.
-
Configurable Delays: Users can set delays between page loads and link transitions to avoid anti-scraping mechanisms and ensure pages fully load.
Product Features Highlighted
- Integration as a Chrome extension with a dedicated tab in DevTools.
- Ability to preview collected data at each step.
- Export scraped data into Excel or other formats.
- Paid subscriptions unlock auto-updating parsers and additional features, but the free version suffices for quick manual scraping.
- User-friendly interface allowing non-developers to create custom parsers in about 15 minutes.
Tutorial / Guide Structure
- Introduction to the need for web scraping without development.
- Installation and setup of the Chrome extension.
- Step-by-step creation of selectors for links and text fields.
- Explanation of multi-level parsing (from product listings to details).
- Handling pagination with paginator selectors.
- Overview of additional selectors like scroll, click, and sitemap.xml.
- Running the scraper and exporting the data.
- Tips on adjusting delays and handling complex sites.
- Encouragement to ask questions and link to the extension in the description.
Main Speaker
- Dima Kapranov — Product Manager with expertise in commerce marketplaces and no-code scraping solutions.
This video serves as a practical guide for business users and analysts who need to gather competitive or market data from websites quickly without relying on developers or complex coding.
Category
Technology