about summary refs log tree commit diff
Commit message (Collapse)AuthorAgeFilesLines
...
* Crawler: Normalise relative urlsBaitinq2022-10-201-2/+17
| | | | | We now normalise urls starting with / (relative to root) and // (relative to protocol)
* Crawler: Remove duplicate parsed urlsBaitinq2022-10-203-0/+20
|
* Crawler: Add basic html parsing and link-followingBaitinq2022-10-203-9/+1561
| | | | | Extremely basic implementation. Needs max queue size, error handling, formatting of parsed links.
* Crawler: Add skeleton crawler implementationBaitinq2022-10-204-0/+1051
| | | | | Starts by filling a queue with the top 1000 most visited sites. "Crawls" each one (empty fn), and blocks for new elements on the queue.
* Misc: Change to use "oxalica/rust-overlay" for the nix development shellBaitinq2022-10-193-26/+90
| | | | This fixes vscode not being able to find rust-analyzer and rust-src
* Misc: Separate OSSE into componentsBaitinq2022-10-199-10/+56
| | | | | We now have a cargo workspace with the Crawler, Client and Indexer packages.
* Initial Commit!Baitinq2022-10-1910-0/+136
This is the initial commit for this experiment of a search engine. I hope I can learn a lot from this!