██▓▒­░⡷⠂𝚝𝚑𝚎 𝚘𝚌𝚝𝚝 𝚒𝚗𝚜𝚒𝚍𝚎 𝚞𝚛 𝚠𝚊𝚕𝚕𝚜⠐⢾░▒▓██ on 2024-07-18T08:56:28Z [JSON]

YaCy, free software for your own search engine:

A tip when crawling is to always disable "Accept URLs with query-part ('?')" for any website, except when totally necessary (eg. phpBB forums, which write the topic id in the query part), otherwise effectively duplicate pages will be enlisted for the majority of sites, degrading the search.

There's also the Searchlab portal which I don't exactly understand: