We implemented a system that applies various algorithms learned in the Discrete Structures course to a real search system to provide personalized search results.
This project collects Google search history data through web crawling and analyzes the frequency of user-input keywords to provide personalized search results.
Extracting search data
via web crawling
Keyword frequency analysis
and pattern recognition
Output of prioritized
personalized results
Represents web pages and hyperlinks as nodes and edges to analyze data relationships and perform efficient web crawling.
Uses AND, OR, and NOT operators to refine queries and provide highly relevant results.
Analyzes the meaning of sentences and extracts sentences containing specific keywords.
Tracks keyword frequency to set priorities and enhance the efficiency of search results.
Efficiently finds sentences containing user-input keywords from collected text.
Sorts filtered words to present results in a more user-friendly way.
Save/Load keywords
Collect Google search results
Track keyword frequency
Select highly relevant information
Saves user-input keywords to a file and loads previously entered keywords at program execution to support continuous learning.
Based on user-input keywords, automatically collects suggested queries and sentences containing the keywords from Google search results.
Tracks the frequency of input keywords. If a keyword is entered beyond a certain threshold, it is prioritized to enhance personalization.
Extracts words from collected sentences and filters only those containing specific keywords to output highly relevant results.
Successfully applied six algorithms learned in the Discrete Structures course to a real search system.
Significantly enhanced the accuracy and relevance of search results through keyword frequency tracking and personalization features.
Six team members completed the project through effective collaboration, each contributing in their area of expertise.
Implemented a fully functional search system using Python and provided accessibility through a QR code.