• ImpossibilityBox@lemmy.world
    link
    fedilink
    arrow-up
    5
    ·
    9 months ago

    Preservation only but not likely any better than a linguistic historian.

    But it gets tricky because LLMs only function on HUGE sets of data. LLMs are nothing more than complicated probability engines. Give it the question “What color is the sky?” and the math extracted from the massive databases that it has says the highest probability answer is “Blue”. It doesn’t actually KNOW the answer it just knows the probabilities of different words.

    Without large amounts of data on the dying language current gen LLM’s won’t be accurate or able to generate reliable answers. Shoot… LLMs can barely generate reliable answers with the massive datasets they currently have.

    I strongly recommend anyone even remotely interested in LLMs to read this interactive article:

    https://ig.ft.com/generative-ai/