Automatically translated from Basque, translation may contain errors. More information here. Elhuyarren itzultzaile automatikoaren logoa

Google Translate: 24 more languages with a new model

  • Google Translate incorporates 24 languages into its automatic translation system. There are languages like the Guarani, the Aimara, the Bambara or the ewe of Ghana that colonialism crushed but did not kill.
Artikulu hau CC BY-SA 3.0 lizentziari esker ekarri dugu.

16 May 2022 - 09:06

Technically, it has marked a new milestone with this increase in Google, as explained in this note. The translation capacity of these languages has been reached through the use of Zero-Shot Machine Translation, based on artificial intelligence, and is characterized by the functioning of this system without the use of bilingual corpus. That is, Google (say) has managed to build the model of that language using only Aymara texts and enable it for translations.

We found this model related to the work and thesis of the Basque computer scientist Mikel Artetxe (Unsupervised Machine Translation), where he developed an automatic translation procedure for minority languages without bilingual corpus. Artetxe now works in the artificial intelligence branch of Facebook-Meta, not Google.

We've tried, translating a text from the Google ad into Aimara language, and then into Euskera. Here are the screen images:

A phrase has been a little special in Basque, "if you want to help the night in the next update...", but, finally, so much.

We have seen that the new languages are already in https://translate.google.es/, but not in the lower Translate window integrated into the search engine home page. The added languages are:

Here are the new languages added by Google Translate:

  • Assam, 25 million speakers in India.
  • Aimara, 2 million speakers, mainly in Bolivia.
  • Bambara, 14 million speakers in Mali and Senegal.
  • Bhojpuri, 50 million speakers in India, Nepal and the diaspora.
  • Maldivera or Dhivehi, 300,000 speakers, national language of the Maldives.
  • Dogri, 3 million speakers in India and Pakistan.
  • Ewe, 7 million speakers in Togo and Ghana.
  • Guarani, 7 million speakers, indigenous and national language of Paraguay.
  • Ilocano, 10 million speakers in the northern Philippines.
  • Konkanera, 2 million speakers, in India, around Goa.
  • The Creole Child, the main language of Sierra Leone.
  • Kurduera (Sorani variant), 15 million speakers in Iraq and Iran.
  • Lingal, 45 million speakers, the main language of Congo, also speaking in neighbouring countries.
  • Luganda, 20 million speakers in Uganda and Rwanda.
  • Love, 34 million speakers in India.
  • Manipurera, 2 million speakers in India.
  • Look, 830,000 speakers in India.
  • 37 million speakers in Ethiopia and Kenya.
  • Quechua, 10 million speakers in Peru, in the Andes in general and in the diaspora.
  • Sanscritic, the ancient classical language of India (its "Latin"), which can contain up to 20,000 speakers.
  • Sepedi or pediera, 14 million speakers in South Africa.
  • Tigrinya, 8 million speakers in Eritrea and Ethiopia.
  • Tsonga, 7 million speakers in South Africa and neighbouring countries.
  • Twi, 11 million speakers in Ghana.

Euskera has been on Google Translate for 12 years, which was added in 2010. Then it had a reasonable quality, but then it has improved a lot, but we believe that tools like Elia.eus or Batua.eus, created in Euskal Herria, are better than Google.


You are interested in the channel: Google
2024-09-16 | Sustatu
The Basque Country Department of Education signed an agreement for the use of Google Workspace for Education without proper compliance with data protection
It is summer news, which appeared in several places, in which the Basque Data Protection Authority (AVPD) opened a file to the Basque Government’s Department of Education for violating the data protection of students by forcing them to use some Google products. We've been... [+]

Europe imposes a fine of EUR 15 billion on multinationals Apple and Google
The European Court of Justice (ECJ) has sentenced the Apple and Ireland case and has ratified the conviction of Google for abuse of a dominant position.

2024-08-19 | El Salto-Hordago
Amazon, the vast Israeli army information store
AWS is the Amazon cloud and is being a key factor in Israel’s military operations.

ChatGPT also knows that data centers will steal millions of liters of water from us.
By pooling artificial intelligence, tech multinationals have multiplied their plans to build large cloud data centers. The mega-fabric ecological footprint with computer equipment is huge: in addition to electricity, they need millions of liters of water to cool their systems... [+]

How to disable the tracking that Google has inserted in Chrome
For years Firefox and Safari have blocked third-party cookies that are used to track. Chrome will do the same in 2025, but they want to leverage to increase their leadership in the advertising business by integrating their tracking into their own browser. The NOYB Group for... [+]

2024-06-07 | Sustatu
Google’s results have shown some reasons for Catalan to appear less
This website documented the fall of Catalan and Basque between 2022-2023 in the search results of Google. Something that had some repair. Recently, from some studies that have come from Catalonia, we have known some reasons for these languages to appear less in the searches. On... [+]

2024-03-22 | Sustatu
Google receives a fine of EUR 250 million in France
The French Competition Authority imposes a fine of EUR 250 million on Google for serious infringements of market competition for actions in the digital advertising sector.

It can be used in Euskera Matomo, the free alternative of Google Analytics
Website managers want to analyze the traffic of their visitors, but the most used Google Analytics tool has privacy and legal problems. Iametza has translated to Euskera the free software Matomo most used as a more ethical alternative.

Denmark considers it illegal for Google to collect data from students
As in many schools in Euskal Herria, Denmark uses Chromebooks and Google Workspace (formerly G Suite for Education). However, it seems that from the next course things are changing, as the Danish data protection agency has considered the use of Google’s personal data to be... [+]

2024-01-24 | David Lindemann
Lift your head and fist

I think Soraluze’s ‘Jaso burua’ initiative is very good, which drives the necessary awareness. Last week, I was at the conference for parents of elementary school children. With the older child in FP, little criticism was heard about the use of screens (and screen... [+]


Governments use mobile communications for espionage, according to an American senator
U.S. Senator Ron Wyden announces a hitherto unknown method of spiking: mobile notifications. U.S. and other state governments are using this technique, Reuters has announced.

Google improves your search results in Basque to achieve a "better treatment of languages"
The Puneus foundation denounced in July that Google's searches discriminated against the Basque country, as only half of the searches were for the Basque country in the first place. In September, 73% of the results have prioritized Euskera.

2023-07-12 | Sustatu
The PuntuEus Observatory confirms with data the damage caused by Google in searches in Euskera
The PuntuEus Foundation Observatory released the measured data on Tuesday in 2022 at a press conference. The event was attended by Albert Cuesta, coordinator of the Aliança per la Presència Digital del Català and Luistxo Fernández, president of the Cultural Association of... [+]

2023-06-09 | Sustatu
A Catalan report shows the change in trend from Google to Spanish
It is known (we have already commented in Sustatu in December and January) that Google discriminates against the results in Basque and Catalan since last year. He promised the Catalans that Google would study the matter. There seems to be no change. On the contrary, the Catalans... [+]

Free and open websites 30 years
It is 30 years since the CERN Physics Research Center made available to everyone the World Wide Web world network or web pages. On 30 April 1993, the software needed to create, host and visit websites was made public, starting a new phase.

Eguneraketa berriak daude