4,002 Downloads Updated 1 year ago
Updated 1 year ago
1 year ago
3aa5a6387d3b · 3.8GB
TowerInstruct-7B is a language model that results from fine-tuning TowerBase on the TowerBlocks supervised fine-tuning dataset. TowerInstruct-7B-v0.1 is the first model in the series. The model is trained to handle several translation-related tasks, such as general machine translation (e.g., sentence- and paragraph/document-level translation, terminology-aware translation, context-aware translation), automatic post edition, named-entity recognition, grammatical error correction, and paraphrase generation. We will release more details in the upcoming technical report. For now, you can check the results obtained with the model here.
Update: TowerInstruct-7B-v0.2 has more reliable document-level translation capabilities in comparison with TowerInstruct-7B-v0.1. The new version of TowerBlocks used to train v0.2 is also available in the Tower collection.
Note: TowerInstruct-v0.2 was trained using the ChatML prompt templates without any system prompts.
The model was initially fine-tuned on a filtered and preprocessed supervised fine-tuning dataset (TowerBlocks), which contains a diverse range of data sources: