• 3 Posts
  • 504 Comments
Joined 2 months ago
cake
Cake day: February 13th, 2026

help-circle







  • anyone can scrap Reddit or Lemmy just fine to train on LLM.

    Well, until you hit flood limits and reddit keeps giving you ‘prove you’re not a robot’ screens or just timegates you from loading new pages at all.

    Plus, it would be a lot more convenient to have API access to automatically provide your AI with only the text of posts and not have to scrape/strip entire pages.

    You could scrape reddit without paying for it, but it would probably be a much slower and more annoying process.



  • Needing to run a full-fledged browser in the background in order to display your html/css frontend adds a lot more performance cost than necessary, making the app eat up far more RAM and CPU than necessary. It probably also introduces a lot more security vulnerability concerns that an otherwise simple app shouldn’t have to worry about. And then there’s the dependency chain you’re introducing – now your app needs to be updated every time the underlying browser gets an important update … and maybe needs to be tweaked/rewritten to accommodate that browser update if it changes the way the browser interacts with your app frontend.

    There are plenty of other GUI frontend frameworks that are also expressive, simple, and well-known, without all of these potential problems associated with them.






  • With the resources they have – and the goal of having security and control over their digital supply lines – I don’t see why they wouldn’t roll their own distro. Maybe heavily based on an existing distro (looking at you, Debian) but something that’s completely under their control, so they can choose for themselves exactly what packages are included, what security settings are used as defaults, when and if to update things, etc.


    Edit: after (gasp) actually reading the article, it seems that each department within the French government will be responsible for their own implementation, which means they might all end up using different distros in different departments.