Llama 2 13b Chat Norwegian

Llama-2-13b-chat-norwegian is a variant of MetaΒ΄s Llama 2 13b Chat model, finetuned on a mix of norwegian datasets created in Ruter AI Lab the summer of 2023.

The model is tuned to understand and generate text in Norwegian. It's trained for one epoch on norwegian-alpaca + 15000 samples of machine-translated data from OpenOrca. A small subset of custom-made instructional data is also included.

For other versions of this model see:

Data

  • Norwegian alpaca
  • 15k Norwegian OpenOrcra (to be released)
  • Small subset of custom made instructional data

Intended Use

This model is intended for commercial and research use in Norwegian and can be used as an assistant-like chat.

Prompt Template

Llama2 Chat uses a new prompt format:

<s>[INST] <<SYS>>
You are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature.
If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information. Please answer in the same language as the user.
<</SYS>>
This is a test question[/INST] This is a answer </s><s>

See also the original implementation here.

We also implemented the alpaca prompt format, which the model supports.:

### Instruction:
Summarize following text.
### Input:
Text to be summarized
### Response:

Why this model?

As a Norwegian company, we understand firsthand the pressing need for powerful language models tailored to specific languages. Our primary focus is on the Norwegian linguistic landscape. In the age of digitization, languages that lack robust, open-source models can risk becoming marginalized. This is why we're introducing this open-source Norwegian model. We believe that by making such resources freely accessible, we can democratize information, foster innovation, and create a more inclusive digital ecosystem. Our aspiration is for this model to serve as a foundational resource for future specialized Norwegian models. Ultimately, our goal is to bolster the Norwegian NLP community and facilitate the smoother integration of Norwegian models into diverse projects.

Limitations

  • This is an LLM, not a knowledge model. It can not be expected to have more information about Norway than the basemodel.
  • It will generally preform better on tasks that involves summarization, question answering and chat, than on tasks that requires more knowledge about Norway, specific domains, or tasks where the model can answer freely.
  • The data used for training is machine translated, and may contain grammatical errors and other errors.
  • The model is released as is, and would in most cases need prompt tuning to achieve optimal results.

License

Llama 2 is licensed under the LLAMA 2 Community License, Copyright Β© Meta Platforms, Inc. All Rights Reserved. See the original model card for more information.

From norwegian-alpaca we also note that "the current version uses OpenAI's gpt-3.5-turbo; hence, this dataset cannot be used to create models that compete in any way against OpenAI."

Disclaimer

  • The model is available "as is". Ruter As takes no responsibility for further use.
  • During testing, it seems that the safeguards implemented by Meta, still work as expected in this model. However, we want to point to the Ethical Considerations and Limitations from the origenal model card:
Llama 2 is a new technology that carries risks with use. Testing conducted to date has been in English, and has not covered, nor could it cover all scenarios.
For these reasons, as with all LLMs, Llama 2’s potential outputs cannot be predicted in advance, and the model may in some instances produce inaccurate, biased or other objectionable responses to user prompts.
Therefore, before deploying any applications of Llama 2, developers should perform safety testing and tuning tailored to their specific applications of the model.
Please see the Responsible Use Guide available at https://ai.meta.com/llama/responsible-use-guide/

Credits

This model was made at Ruters AI Lab which is a part of Ruters Data & AI division.


Llama 2 13b Chat Norwegian (Norsk)

Llama-2-13b-chat-norwegian er en versjon av Meta sin Llama 2 13b Chat model, finetuned pΓ₯ en kombinasjon av diverse norske datasett. Modellen ble laget i Ruter AI Lab 2023.

Modellen er finetuned til Γ₯ forstΓ₯ og generere tekst pΓ₯ Norsk. Den er trent i Γ©n epoch med norwegian-alpaca + et utvalg av 15000 maskinoversatt data fra OpenOrca. Det bestΓ₯r og av et lite sett med selvlagde instruksjonsdata

Andre versjoner av modellen:

Data

  • Norwegian alpaca
  • 15k Norwegian OpenOrcra (venter pΓ₯ utgivelse)
  • Lite sett med selvlagde instruksjonsdata

Prompt Mal

Llama2 Chat bruker et nytt prompt format:

<s>[INST] <<SYS>>
You are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature.
If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information. Please answer in the same language as the user.
<</SYS>>
This is a test question[/INST] This is a answer </s><s>

Se orgianl implementasjon her.

Vi har ogsΓ₯ implementert alpaca prompt formatet, som ogsΓ₯ er stΓΈttet av modellen.

### Instruction:
Summarize following text.
### Input:
Text to be summarized
### Response:

Hvorfor denne modellen?

Som et norsk selskap forstΓ₯r vi selv det presserende behovet for kraftige sprΓ₯kmodeller tilpasset spesifikke sprΓ₯k. VΓ₯rt primΓ¦re fokus er pΓ₯ det norske sprΓ₯komrΓ₯det. I den digitale alderen risikerer sprΓ₯k som mangler robuste, Γ₯pne kildekodemodeller Γ₯ bli marginalisert. Dette er grunnen til at vi nΓ₯ introduserer denne Γ₯pne kildekodemodellen for norsk. Vi tror at ved Γ₯ gjΓΈre disse ressursene tilgjengelige gratis, kan vi demokratisere informasjonen, fremme innovasjon og skape et mer inkluderende digitalt ΓΈkosystem. VΓ₯r ambisjon er at denne modellen skal tjene som en grunnleggende ressurs for fremtidige spesialiserte norske modeller. VΓ₯rt mΓ₯l er Γ₯ styrke det norske NLP-miljΓΈet og gjΓΈre det enklere Γ₯ innlemme norske modeller i ulike prosjekter.

Begrensninger

  • Dette er en LLM, ikke en kunnskapsmodell. Den kan ikke forventes Γ₯ ha mer informasjon om Norge enn basismodellen.
  • Den vil generelt prestere bedre pΓ₯ oppgaver som innebΓ¦rer oppsummering, spΓΈrsmΓ₯lsbesvarelse og chat, enn pΓ₯ oppgaver som krever mer kunnskap om Norge, spesifikke domener, eller oppgaver hvor modellen kan svare fritt.
  • Dataene som brukes til trening er maskinoversatt, og kan inneholde grammatiske feil. Vi har kun gjort en rask manuell sjekk av dataene.
  • Modellen er utgitt som den er, og vil i de fleste tilfeller trenge "prompt tuning" for Γ₯ oppnΓ₯ ΓΈnskede resultater.

Lisens

Llama 2 er lisensiert under LLAMA 2 Community License, Copyright Β© Meta Platforms, Inc. All Rights Reserved. Se det orginale modell kortet for mer informasjon.

Fra norwegian-alpaca vil vi gjΓΈre oppmerksomme pΓ₯ at "the current version uses OpenAI's gpt-3.5-turbo; hence, this dataset cannot be used to create models that compete in any way against OpenAI."

Ansvarsfraskrivelse

  • Modellen tilgjengeliggjΓΈres Β«som den erΒ». Ruter As tar ikke noe ansvar for videre bruk.
  • Under testingen virket det som sikkerhetstiltakene implementert av Meta fortsatt fungerer som forventet for denne modellen. Vi gjΓΈr derimot oppmerksom pΓ₯ de etiske betraktiningene og begrensningene fra det orignale modellkortet:
Llama 2 is a new technology that carries risks with use. Testing conducted to date has been in English, and has not covered, nor could it cover all scenarios.
For these reasons, as with all LLMs, Llama 2’s potential outputs cannot be predicted in advance, and the model may in some instances produce inaccurate, biased or other objectionable responses to user prompts.
Therefore, before deploying any applications of Llama 2, developers should perform safety testing and tuning tailored to their specific applications of the model.
Please see the Responsible Use Guide available at https://ai.meta.com/llama/responsible-use-guide/
Downloads last month
1,070
Safetensors
Model size
13B params
Tensor type
F32
Β·
F16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for RuterNorway/Llama-2-13b-chat-norwegian

Quantizations
3 models

Datasets used to train RuterNorway/Llama-2-13b-chat-norwegian

Spaces using RuterNorway/Llama-2-13b-chat-norwegian 3