• narwhal@lemmy.mlOP
    link
    fedilink
    arrow-up
    2
    ·
    9 months ago

    What about preserving languages that are close to extinct, but still have language data available? Can LLMs help in this case?

    • ImpossibilityBox@lemmy.world
      link
      fedilink
      arrow-up
      5
      ·
      9 months ago

      Preservation only but not likely any better than a linguistic historian.

      But it gets tricky because LLMs only function on HUGE sets of data. LLMs are nothing more than complicated probability engines. Give it the question “What color is the sky?” and the math extracted from the massive databases that it has says the highest probability answer is “Blue”. It doesn’t actually KNOW the answer it just knows the probabilities of different words.

      Without large amounts of data on the dying language current gen LLM’s won’t be accurate or able to generate reliable answers. Shoot… LLMs can barely generate reliable answers with the massive datasets they currently have.

      I strongly recommend anyone even remotely interested in LLMs to read this interactive article:

      https://ig.ft.com/generative-ai/