| Commit message (Collapse) | Author | Files | Lines | ||
|---|---|---|---|---|---|
| 2022-10-20 | Crawler: Add indexer interaction skeleton | Baitinq | 1 | -1/+5 | |
| 2022-10-20 | Crawler: Wrap crawl response in Result type | Baitinq | 1 | -18/+23 | |
| 2022-10-20 | Crawler: Normalise relative urls | Baitinq | 1 | -2/+17 | |
| We now normalise urls starting with / (relative to root) and // (relative to protocol) | |||||
| 2022-10-20 | Crawler: Remove duplicate parsed urls | Baitinq | 3 | -0/+20 | |
| 2022-10-20 | Crawler: Add basic html parsing and link-following | Baitinq | 3 | -9/+1561 | |
| Extremely basic implementation. Needs max queue size, error handling, formatting of parsed links. | |||||
| 2022-10-20 | Crawler: Add skeleton crawler implementation | Baitinq | 4 | -0/+1051 | |
| Starts by filling a queue with the top 1000 most visited sites. "Crawls" each one (empty fn), and blocks for new elements on the queue. | |||||
| 2022-10-19 | Misc: Change to use "oxalica/rust-overlay" for the nix development shell | Baitinq | 3 | -26/+90 | |
| This fixes vscode not being able to find rust-analyzer and rust-src | |||||
| 2022-10-19 | Misc: Separate OSSE into components | Baitinq | 9 | -10/+56 | |
| We now have a cargo workspace with the Crawler, Client and Indexer packages. | |||||