Yeah so here's a NIP for the nostr: easy plagiarism detection
X tag (big X) = simhash of content field
If clients standardize putting the simhash of the content into the X tag, you can query nostr for plagiarized content and it will even work if they changed the content slightly. @Fanfares is doing this already.
Here's a simhash implementation
GM @utxo the webmaster ๐งโ๐ป @Vitor Pamplona @jb55 @fiatjaf
View quoted note โ
GitHub
GitHub - arkin0x/simhash-ts: Convert text into a similarity hash based on sha256. Inspired by https://matpalm.com/resemblance/simhash/
Convert text into a similarity hash based on sha256. Inspired by https://matpalm.com/resemblance/simhash/ - arkin0x/simhash-ts

might try out hermes, looks like what I've been seeking 
