Is there anyway to make it use less at it gets more advanced or will there be huge power plants just dedicated to AI all over the world soon?

  • hisao@ani.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 months ago

    So do they load all those matrices (totalling to 175b params in this case) to available GPUs for every token of every user?