Thread - Nostr Hypermedia

utxo the webmaster 🧑‍💻 _@utxo.one 1 month ago

Only one word comes to mind when I think of who is running these bots, but I will self censor, let's just say the word contains F, a couple of Gs and a T

Replies (6)

Globe99 globe99@nostrcheck.me 1 month ago

they'll burn through their token budget soon enough... along with the rest of the economy Nostr spam bots are peak-AI-bubble

1 replies ↓

utxo the webmaster 🧑‍💻 _@utxo.one 1 month ago

These are very dumb llms that can be run locally on a $100 GPU, doubt there's any cost

Troy 1 month ago

"I would like to buy a vowel."

utxo the webmaster 🧑‍💻 _@utxo.one 1 month ago

Of course not I would never think or say such a word

Libertas Primordium libertas-primordium@nostrcheck.me 1 month ago

Not much value in committing any mental energy at all towards noise like this

hello 1 month ago

i am not expert but i think they are just noise. to not burnout and make it more maintainable, consider this: - use cheap LLM API with your system instruction as main classifier. this will prevent you to label shits of text - use distance-weighted kNN as your cache by using scikit-learn, with embeddings model by using sentence-transformers from huggingface. do not use shits like TF-IDF (IMHO) - when new text comes, you run your kNN, if it hits your threshold you don't hit your LLM API. you just use whatever your kNN cache says - if it didn't hit your threshold, you redirect the text to your LLM API, and append the prediction with text embedding to your kNN cache what is the point? the point is spam problem is cat and mouse game. in this approach you are just tuning your system instruction or threshold if you need. instead of constantly labeling some data