Luu Tuyen@lemmy.world to Technology@lemmy.worldEnglish · 9 个月前TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAIfortune.comexternal-linkmessage-square128fedilinkarrow-up1569arrow-down17
arrow-up1562arrow-down1external-linkTikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAIfortune.comLuu Tuyen@lemmy.world to Technology@lemmy.worldEnglish · 9 个月前message-square128fedilink
minus-squarechickenf622@sh.itjust.workslinkfedilinkEnglisharrow-up18·9 个月前Like those platforms aren’t already full of AI garbage as well. Training new models will require a cut-off date before the genie was let out of the bottle.
minus-squareDrunemeton@lemmy.worldlinkfedilinkEnglisharrow-up4·9 个月前I think that’s the “25-times faster” bit. They seem to be in a hurry to collect as much human-generated data as possible.
minus-squareGHiLA@sh.itjust.workslinkfedilinkEnglisharrow-up4·9 个月前How does it know what is and isn’t? Uh oh.
minus-squareDrunemeton@lemmy.worldlinkfedilinkEnglisharrow-up5·9 个月前Yeah… Hey! Perhaps they’ll use A.I. to weed out the A.I. generated bits.
minus-squareJackbyDev@programming.devlinkfedilinkEnglisharrow-up1·9 个月前I mean, if I could theoretically take a snapshot of the entire Internet I’d rather do it now than later because there’s just gonna be more AI later.
deleted by creator
Like those platforms aren’t already full of AI garbage as well. Training new models will require a cut-off date before the genie was let out of the bottle.
I think that’s the “25-times faster” bit. They seem to be in a hurry to collect as much human-generated data as possible.
How does it know what is and isn’t?
Uh oh.
Yeah…
Hey! Perhaps they’ll use A.I. to weed out the A.I. generated bits.
I mean, if I could theoretically take a snapshot of the entire Internet I’d rather do it now than later because there’s just gonna be more AI later.