Language Grid

Multilingual service platform for intercultural collaboration From Wikipedia, the free encyclopedia

The Language Grid (言語グリッド, Gengo Guriddo) is an online multilingual service platform that integrates language resources such as dictionaries, bilingual corpora, and machine translation systems through web service technologies. It was developed in Japan as a research initiative to facilitate the sharing and composition of language services[1][2]. While it initially started as an experimental infrastructure centered on the National Institute of Information and Communications Technology (NICT) and Kyoto University, after more than 10 years of operation, it has developed into an international experimental network with the participation of organizations from over 24 countries, and the server software has been released as open source.

The platform has been applied in several public-sector projects, including support systems for foreign patients in medical facilities and a multilingual agricultural support program known as the YMC model in rural Vietnam[2][3]. In 2017, the management entity was transferred from Kyoto University to the NPO Language Grid Association[4].

Design Philosophy: From Language Resources to Language Services

The impetus for the Language Grid was the "Intercultural Collaboration Experiment (ICE)" conducted in 2001. In this experiment, joint software development using machine translation was carried out between universities in Japan, China, South Korea, and Malaysia. However, conversations were often interrupted because existing translation engines could not handle specialized terminology (jargon). The challenge identified was that users had no means to customize their environment by combining dictionaries and translation engines suitable for their own communities.

A platform for sharing multilingual resources and building language environments tailored to user needs.

Based on these experiences, the Language Grid project was launched in 2006 under the slogan "from language resources to language services." To improve the accessibility and usability of language resources, a service-oriented approach (SOA) was proposed, in which resources are deployed as Web services on the Internet. By wrapping language resources as "atomic services," they can be combined to construct new "composite services." This allows users to configure customized multilingual environments. Experimental operation began in 2007, and the server software has been released as open source since April 2010.

Service Sharing Mechanism

Stakeholders

The Language Grid aimed to improve the usability of language resources while mitigating risks related to intellectual property rights when disclosing language resources. By wrapping resources as services, providers can control their own intellectual property rights. The Language Grid defines stakeholders, their roles, and the social protocols between them[5]. Users of the Language Grid are defined as taking on one or more of the following three roles:

  • Service Providers deploy language resources on the Language Grid as Web services. These are called atomic services. When registering a service, they specify access control policies such as availability and frequency of use. Service providers can also provide workflows for combining atomic services. These are called composite services. The above atomic services and composite services are collectively referred to as language services.
  • Service Consumers use language resources by invoking atomic services. When a service consumer invokes a composite Web service, the invocation request is sent to a workflow engine, and the workflow engine executes a workflow that combines one or more atomic language services.
  • Grid Operators manage and control the Language Grid (service grid) for service providers and consumers. A grid operator exists for each service grid.
A hierarchical structure based on service-oriented architecture (SOA).

Protection of Intellectual Property Rights

The roles of the Language Grid stakeholders are explicitly stated in the memorandum of understanding[5]. For service providers, the protection of intellectual property rights is extremely important. To meet this requirement, the purposes of service use are classified into the following three types:

  • Non-profit use: The language service is used for public or non-profit purposes.
  • Research use: The language service is used with the intent of advancing a specialized field. It is not intended for commercial profit.
  • For-profit use: The language service is used for commercial profit.

The above classification of use purposes is independent of the form of the organization, such as for-profit or non-profit. For example, Corporate Social Responsibility (CSR) activities are classified as "non-profit use." This is because such activities are often conducted together with public institutions or non-profit organizations. Conversely, even activities of public institutions or non-profit organizations are classified as "for-profit use" if they are for commercial profit.

Server Software

The Language Grid is composed of four service layers: the P2P grid layer (P2P service grid layer), the atomic service layer, the composite service layer, and the application system layer[2][1]. The P2P grid layer is constructed using open-source server software (Service Grid Server Software).

The main components of the server software are the Service Supervisor and the Grid Composer[3].

  • Service Supervisor: Invokes language services in accordance with the access control policies specified by the service provider. Before a service consumer invokes a service on a container, it verifies whether the invocation meets the provider's policy.
  • Grid Composer: Performs interconnection between service grids operated in different regions.
Internal configuration responsible for service management, execution control, and service orchestration.

The Language Grid server software was developed by the Language Grid Project of the National Institute of Information and Communications Technology (NICT). Development began in 2006, enabling more than 20 multilingual translations. It has been maintained as open source software since April 2010[6].

Operation Structure and Governance

Operation Centers

The Language Grid is managed through a federated network of regional operation centers. The Kyoto Operation Center was established in December 2007 by the Department of Social Informatics at the Kyoto University Graduate School of Informatics to operate the platform for non-profit and research purposes[7]. In May 2017, the management of the Kyoto Operation Center was transferred from Kyoto University to the NPO Language Grid Association[4]. According to a 2018 report, 183 groups from 24 countries and regions participated in the center, with 226 services shared through federated operation[1].

Other regional operation centers were established to facilitate international federation:

These four operation centers are interconnected, realizing a federated operation to share language services among multiple service grids. These efforts have been reported as a means to facilitate communication on the Web[8].

Research Organizations

Research on the development and use of the Language Grid spans multiple fields, including artificial intelligence, services computing, and human–computer interaction (HCI).

Since 2006, research funding for the Language Grid has been provided by the National Institute of Information and Communications Technology (NICT), the Japan Society for the Promotion of Science (JSPS), the Research Institute of Science and Technology for Society (RISTEX) of the Japan Science and Technology Agency (JST), and the Strategic Information and Communications R&D Promotion Programme (SCOPE) of the Ministry of Internal Affairs and Communications.

In November 2015, joint research on the federation of language service platforms began among the Language Grid, the European Language Resources Association (ELRA), and the Linguistic Data Consortium (LDC).

Impact and Extensions

The Language Grid has influenced academic research and social activities. These extensions and implementations are conducted independently by various research groups and organizations.

Academic Evolution of the Design Philosophy

The design philosophy of the Language Grid has inspired further academic research, leading to the proposal of new frameworks by independent research groups.

In the Open Multilingual Wordnet project, a multilingual dictionary network, the design philosophy of the Language Grid is cited as a requirement for language resources to be widely disseminated and useful. It states that ensuring "accessibility" through open licensing and improving "usability" through standardized interfaces are essential for building a sustainable language infrastructure[9].

The "Language Application Grid (LAPPS Grid)" in the United States, funded by the National Science Foundation (NSF), adopted the Language Grid's server software with the aim of increasing the interoperability and reusability of natural language processing (NLP) tools[10]. The LAPPS Grid aimed to create an environment where researchers can easily configure and execute complex language analysis pipelines by integrating distributed NLP tools and data onto a common platform.

In Europe, the "European Language Grid (ELG)" project (2019–2022) was implemented, sharing a common design philosophy with these preceding projects[11]. The ELG adopted the service-oriented approach advocated by the Language Grid, such as ensuring interoperability through standardized APIs and building a scalable infrastructure.

Third-party Applications and Use Cases

The Language Grid has been utilized by various third-party organizations and local communities for diverse social activities. These implementations are managed independently of the core Language Grid project. To support these activities, the NPO Language Grid Association[4] was established in 2017 by universities, companies, and NPOs.

Support for Foreign Patients in Hospitals

When foreign tourists fall ill in Japan, they may not be able to receive appropriate treatment due to a lack of communication with Japanese doctors. Therefore, a multilingual medical communication support system was jointly developed by Wakayama University and NPO Kyoto Multicultural Center[12][13].

This NPO dispatched volunteer interpreters to affiliated hospitals approximately 1,700 times a year. The multilingual medical communication support system supported communication between foreign outpatients and medical staff at hospital receptions. Medical staff used this system to ask outpatients about their symptoms and to provide guidance on various hospital sections.

Multilingual Agricultural Support (YMC Model)

From 2011 to 2014, in collaboration with the NPO Pangaea[14] and universities in Japan and Vietnam, a rice cultivation support project was implemented in Vĩnh Long province in the Mekong Delta. The project aimed to improve rice productivity and reduce pesticide use using the Language Grid.

As it was difficult for Japanese agricultural experts to visit the site constantly, advice was provided using the Language Grid. However, there were challenges such as low literacy rates in the target area and farmers' unfamiliarity with computer operations. To solve these, the "Youth Mediated Communication (YMC)" model was devised. In this model, children with IT literacy acted as mediators between agricultural experts and farmers (their parents), playing a role in bridging gaps in language, knowledge, and culture. When machine translation accuracy was insufficient, human volunteers called "bridgers" (mainly agricultural students) performed online translation corrections.

The YMC model attracted attention from local education and agricultural authorities, with its results being announced by the Ho Chi Minh City Department of Education and Training[15]. Additionally, this initiative was featured in major local Vietnamese media, reporting on children supporting agriculture through technology[16][17].

Project Evaluation and Future

Beyond technical development, the Language Grid functions as a social infrastructure (social machine) that supports multicultural coexistence and international exchange. In recognition of its contribution, it was awarded the "Achievement Award" by the Institute of Electronics, Information and Communication Engineers (IEICE) in 2012[18].

The project demonstrated that customized translation environments could be constructed using a service-oriented approach. However, practical implementation, such as the agricultural support project in Vietnam, showed that human intervention was still necessary to ensure sufficient translation quality for specialized fields.

With the emergence of large language models (LLMs), the technical constraints surrounding context, culture, and common sense—which were difficult to address through the simple combination of traditional language services—are being re-evaluated. Consequently, the project's focus has transitioned toward research into "translation agents" that can leverage these new capabilities within the framework of collaborative language services[19].

References

Related Articles

Wikiwand AI