On Tuesday, Meta announced Llama 2, a new open source family of AI language models notable for its commercial license, which means the models can be integrated into commercial products, unlike its predecessor. They range in size from 7 to 70 billion parameters and reportedly "outperform open source chat models on most benchmarks we tested," according to Meta.
"This is going to change the landscape of the LLM market," tweeted Chief AI Scientist Yann LeCun. "Llama-v2 is available on Microsoft Azure and will be available on AWS, Hugging Face, and other providers."
According to Meta, its Llama 2 "pretrained" models (the bare-bones models) are trained on 2 trillion tokens and have a context window of 4,096 tokens (fragments of words). The context window determines the length of the content the model can process at once. Meta also says that the Llama 2 fine-tuned models, developed for chat applications similar to ChatGPT, have been trained on "over 1 million human annotations."
Read 10 remaining paragraphs | Comments
https://ift.tt/NByfIhC
Comments