Pixelologist@lemmy.worldtoReddit@lemmy.ml•The Verge reacting to Reddit's spokesperson trying to discredit themEnglish
5·
1 year agoPlease correct me if I’m mistaken but isn’t the reddit dataset used to train LLMs from before Chat GPT became widely known? I was under the impression data from that point onwards was poisoned and not useful for training purposes
I can’t seem to find it now but I remember there being a ~90gb .zip megadb upload that got passed around a lot on machine learning reddit subs that was a snapshot of reddit before x date
I use USB-B all the time (fightsticks) and still call them printer cables lmao