---------- NGA ----------

2 Phase I Selections from the 06.1 Solicitation

(In Topic Number Order)
9500 Innovation Drive
Manassas, VA 20110
(703) 368-6107
Mr. Daniel Mekonnen
NGA 06-001      Awarded: 02JUN06
Title:Modularized Transliteration Engine
Abstract:Applications today that serve transliteration provide only a limited number of internally supported systems. The information communities coping with toponymic issues must be able to easily add new systems to a transliteration engine, extend or update an already supported system. These additions, updates and extensions must also be readily shared within a community of interest utilizing the GIG. Our solution is a versatile, GIG centric architecture and modular architecture for sharing transliteration data and rules expressed in a simple yet power language. Transliteration definitions can be composed online in a "wizard" interface or with any text editor and uploaded to a community repository. The meta-language applies an inheritance model which allows departments to make their own local extensions to a transliteration system in the repository to serve a particular purpose without impacting other users. The architecture includes a library of APIs for the transliteration engine to work both online as a "web service" or offline as a stand alone application where transliteration definition files would be read from a local directory. Transliteration into multiple systems at once is possible as well an inter-script text matching service where script is first folded into the IPA system.

12636 Research Blvd #C214
Austin, TX 78759
(512) 250-1018
Dr. Ira Baxter
NGA 06-001      Awarded: 05JUN06
Title:Modularized Transliteration Engine based on DMS
Abstract:This SBIR project will define a means for specifying and executing multiple transliteration systems to map written words in scripts in a variety of natural languages into Latin scripts. The work will investigate the issues behind transliteration of place names. It will define a domain-specific language, TSL, in which a transliteration system can be coded in a format easily written, understood and maintainable by a transliteration expert, using the scripting systems of the source and target natural language. The goal for TSL is to be able to specify transliterations between many pairs of language, but emphasis will be place on transliterations to Latin scripts. The work will encode in TSL the 70 systems used in the National Geospatial-Intelligence Geographic Names Database. A compiler for TSL will be constructed to generate an extremely efficient implementation of a TSL instance. Compiled transliterators will be packaged for integration with systems such as NGA-GND, and will include heuristic spelling correctors based on supplied lists of potential target phrases. The transliteration compiler technology will be based on a mature, automated source-to-source transformation system, DMS, which has been used for many computer-language translation and analysis tasks over the past decade.