I’m just a nerd girl.

  • 1 Post
  • 128 Comments
Joined 1 year ago
cake
Cake day: March 4th, 2024

help-circle







  • Rose@lemmy.worldtoTechnology@lemmy.world*Permanently Deleted*
    link
    fedilink
    English
    arrow-up
    62
    ·
    4 months ago

    I have no idea why the makers of LLM crawlers think it’s a good idea to ignore bot rules. The rules are there for a reason and the reasons are often more complex than “well, we just don’t want you to do that”. They’re usually more like “why would you even do that?”

    Ultimately you have to trust what the site owners say. The reason why, say, your favourite search engine returns the relevant Wikipedia pages and not bazillion random old page revisions from ages ago is that Wikipedia said “please crawl the most recent versions using canonical page names, and do not follow the links to the technical pages (including history)”. Again: Why would anyone index those?





  • Oh yeah, one of the pics that inspired me to study French. I was dreading the numerals but it’s not that bad. You count tens and twenties and sometimes they’re special. And numbers below 20 have specific names, but that’s kinda true in most languages.

    A lot of languages have weird corner cases. (Like, in Finnish most numbers are perfectly regular. Except 11-19 which are not “one-ten-and-x” but rather “x-of-the-second”. I’m sure there’s a reasonable etymological reason. At least they’re not “teens”.)






  • I run ad blockers. As a security measure. Ad companies collect insane amount of data and do a bunch of shady stuff whenever they can get away with it.

    I want to support websites whenever I’m able, but the way ad companies operate just ain’t it.

    If they clean up their act, maybe then I could stop using ad blockers, but it’s been decades and I don’t have high hopes.

    Also using ad blockers for performance and usability reasons. For example, I used to use a bunch of Fandom wikis and couldn’t understand why people hated the UI. Then I saw how Fandom looks like without ad blockers and holy shit how can humans live like this