• fleabs@lemmy.world
    link
    fedilink
    English
    arrow-up
    20
    ·
    edit-2
    11 months ago

    You say “simply train,” but really, the training of these models is The most intensive part. Once they are trained, they require less power (relatively) to actually run for inference.

    • Corkyskog@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      11 months ago

      So it sounds like they need a shitload of GPU power. You know what also costs a shitload of GPU power crypto mining? Could they not outsource the work to all those GPUs that stopped mining crypto once it plummeted?

      I am surprised this hasn’t become a community project already. I assume there is some limitation that I am unaware of.

      • pivot_root@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        11 months ago

        The limitation is intellectual property. You need to model to train it, and no for-profit company is going to just give that away.

        • Corkyskog@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          1
          ·
          11 months ago

          But they (MS) are planning on doing it either way, why not crowdsource and even pay a small pittance for the GPU power? I think it would be popular… there are a lot of sad people with extra GPUs sitting around not being used for much.