Features
- Key features: Counting and comparing HTML elements within SERP results, Tracking the frequency of specific keywords in SERP results, Displaying the placement of web pages in the SERP, Evaluating the importance of HTML tags and keyword usage.
- SEO Webpage Comparator: A tool for comparing HTML elements and SEO parameters of websites, crucial for internet positioning.
- Google SERP Analysis: Automatically fetches and analyzes Google's Search Engine Results Pages (SERP) based on user-input keywords.
- Advanced Data Extraction: Identifies specific content definitions, including Wikipedia entries and Google Maps content.
- Comprehensive SEO Parameters Analysis: Implements functionality to analyze selected SEO parameters of webpages.
- Real-time Data Processing: Efficiently processes and compares data in real-time, providing immediate insights.
- Customizable Data Filters: Offers users the ability to customize search parameters for more targeted analysis.
- User-Friendly Interface: Intuitive and interactive user interface for seamless operation and user experience.
Used Skills
- C++ Programming: Developed in C++ using CodeBlocks, tailored for the Linux Debian system.
- cURL Package: Utilizes cURL for downloading webpages for subsequent analysis.
- Source Code Analysis: Functions implemented for analyzing Google's SERP source code and extracting key HTML elements impacting SEO.
- Automated Keyword Analysis Process: Incorporates functions for automatic analysis of key phrases, enhancing program efficiency and flexibility.
- User-Interactive Interface: Features an interactive user interface in main.cpp, facilitating user engagement and control.
- Customizable Analysis: Allows for easy modification and addition of elements to analyze, providing user-driven customization.
- Comprehensive Data Output: Analyzed results are saved in text files and CSV format for further analysis in spreadsheet applications.
- Data Mining and Analysis: Expertise in extracting, processing, and analyzing large datasets for insightful conclusions.
- Debugging and Problem-Solving: Proficient in identifying and resolving software bugs, ensuring reliable performance.
- Version Control with Git: Efficient management of project versions and collaborative development using Git.
How to Use
The program provides two primary modes of operation:
- Batch Analysis: Users have the option to provide a list of search phrases stored in a text file "default.txt". When this option is chosen, the program automatically retrieves search phrases from the file and analyzes the SERP results for each phrase on the list. This mode is particularly useful for analyzing multiple search queries in one go.
- Add your own elements to analysis: Users have the option to customize and add their own elements for analysis. Simply append new search phrases to the "elementy_do_wyszukania.txt" file, starting from the second line. The program will automatically include these additional elements in the analysis.
- Single Query Analysis: Alternatively, users can manually enter a single search phrase directly into the terminal. In this mode, the program will perform an analysis of the SERP results specific to the entered query. This mode is suitable for quick, one-off analyses.
- Saved results: Results from SERP analysis are in 'searched_google_sites/przeanalizowane_wyniki_serp.txt'. Results from analysis of particular websites are in 'ostatnie_analizy_stron/'.
Example Use Case
"For instance, if you enter the search query "house in Wrocław," the program will count how many times the words "house" and "Wrocław" appear on web pages within the SERP. It will also determine the placement of each web page in the SERP rankings. Furthermore, the program will analyze the frequency of HTML tags used in the SERP, providing insights into which tags are most commonly employed and how frequently specific keywords should be used within those tags.
The Engineering Aspect for Master's Thesis program empowers users to make informed decisions regarding HTML structure and keyword usage based on real-world SERP data, enhancing the effectiveness of web content optimization strategies."