According to Hitz Zentroa "is the family of open models" Latxa, which includes the "largest linguistic model in Basque". It is built on the linguistic model Meta or Facebook Llama 2 and follows its license. Llama 2 has already seen excellent results in Basque, able to perform a correct oral machine translation in Basque via the product Seamless M4T. Latxa’s logo is precisely the one that links Llama and the Basque sheep, although there is also a connection in the name (as we thought).
Latxa collects models of between 7 and 70 billion parameters. Regarding the set of texts for the construction of models, Basque researchers have used EusCrawl, a set of texts in Basque of 1.72 million documents and 288 million words. EusCrawl was extracted from 33 quality websites, offering higher quality than other corpus training techniques from the Internet.
In fact, Latxa has not been done for the general public, that will come later. However, the three models are available on the Huwaukee Face platform and can be used by the expert engineer by checking the “model card”, where the instructions for technical information and initiating the use of the models are located.
The development of Latxa has been the result of a research, innovation and development initiative, which is part of the IKER-GAITIK project, supported by the Basque Government, in cooperation with the European EuroHpc programme.
Today's language models have amazing performance, like English ChatGPT or English Bard. However, in the case of minority languages and the Basque language no. With these models he took a step in the session of Hitz Zentroa to turn the situation around, and according to his data, Latxa responds better than other systems to formulations in Basque.
More information, here.
In Hugginface: Latxa.
Many years ago, Dr. I knew the abuse chatbot, and I also realized the speed at which people can engage with these machines. Being social animals, the relationship is natural and necessary, and as the name 'relationship' says, it always leads to a response from the other. Receiving... [+]
In the last year, it has happened to me to see people related to the non-professional realm in digital groups that have used artificial intelligence to give arguments to others. I share it as my own. One's own, but not linked to the sense of property, but processed from one's own... [+]
Artificial Intelligence (AI) is revolutionizing not only our daily lives, but also the way we work in companies and interact with companies through Artificial Intelligence tools or developments in the use of language technology. It is also to be hoped that in the coming years... [+]
Human beings have never been easy to think calmly for long periods of time, we live with the responsibility of taking our lives forward, both ours and our descendants. In this opportunity that we have had to live, we want to do things as best we can. For these responsibilities,... [+]
Euskera is a port for knowledge and relationships at sea, which is a digital space. With artificial intelligence, it seems that from this port there is the possibility of contact in Basque with everyone. The automation of the Basque Country is a great support for educators... [+]