Sony Research and AI Singapore (AISG) will collaborate on research for the SEA-LION family of large language models (LLMs).
SEA-LION, which stands for Southeast Asian Languages In One Network, aims to improve the accuracy and capability of AI models when processing languages from the region. This is particularly important given the linguistic diversity of Southeast Asia, which is home to over a thousand different languages.
“As a global company, diversity and localisation are vital forces,” said Hiroaki Kitano, President of Sony Research. “In Southeast Asia specifically, there are more than a thousand different languages spoken by the citizens of the region. This linguistic diversity underscores the importance of ensuring AI models and tools are designed to support the needs of all populations around the world.”
The collaboration will focus on testing and improving the SEA-LION model, with a particular emphasis on Tamil, a language spoken by an estimated 60-85 million people worldwide. Sony Research will leverage its expertise in Indian languages – including Tamil – and its research in speech generation, content analysis, and recognition.
“Access to LLMs that address the global landscape of language and culture has been a barrier to driving research and developing new technologies that are representative and equitable for the global populations we serve,” Kitano added.
The collaboration is further strengthened by Kitano’s existing ties to the Singaporean technology landscape. He holds positions on numerous advisory councils and boards in the country, including the Advisory Council on the Ethical Use of AI and Data, the Infocomm Media Development Authority (IMDA), the Singapore Economic Development Board (EDB), and the National Research Foundation, Singapore (NRF).
“The integration of the SEA-LION model, with its Tamil language capabilities, holds great potential to boost the performance of new solutions,” Teo continued. “We are particularly eager to contribute to the testing and refinement of the SEA-LION models for Tamil and other Southeast Asian languages, while also sharing our expertise and best practices in LLM development.
“We look forward to seeing how this collaboration will drive innovation in multilingual AI technologies.”