• 0 Posts
  • 372 Comments
Joined 11 个月前
cake
Cake day: 2023年8月8日

help-circle

  • 4am@lemm.eetothe_dunk_tank@hexbear.netactual twitter ad
    link
    fedilink
    English
    arrow-up
    8
    ·
    16 小时前

    And they’ll try to claim that it’s considered “fair use transformation” under copyright law, the joke ass Supreme Court will hand them a win and the United States will become blanketed in datacenters as a million grifters try to set up their own unnecessary LLMs in order to get illiterate dumbasses to sign up for subscriptions they hope they forget to cancel











  • It does sound to me like ingesting all these different formats into a normalized database (aka data warehousing) and then building your tools to report from that centralized warehouse is the way to go. Your warehouse could also track ingestion dates, original format converted from, etc. and then your tools only need to know that one source of truth.

    Is there any reason not to build this as a two-step process of 1) ingestion to a central database and 2) reporting from said database?