Kannada/ToDo

From Kannada l10n Wiki

Jump to: navigation, search

List of ideas for GSoC 2008 submission. Outcome of the projects are intended to be released as free software (GPLv2 or later).

ಪರಿವಿಡಿ

Unicode Standard Kannada Font

Design an Open Type Font for general use to be released in GPLv2 or later that includes Nuktha and old Kannada characters. The font can optionally replicate glyphs of any of the copyright expired traditional press font.

Significance:

Presently there is hardly any complete, bug-free Unicode (Open Type) Font for Kannada. Kannada users are still looking for a proper general use font. Publishing industry in Kannada still relies on TTF fonts with old proprietary encodings. Absence of a good Kannada Open Type font has been one of the factors. When compared with the traditional press, the choice of fonts is limited too.
Option of choosing a traditional press font could bring some of the fonts that are on their way to extinction to the digital world. Some wonderful Kannada fonts were used in the traditional press that are now widely forgotten, but not hard to find.

Resources

Books

  • Fonts and Encodings by Yannis Haralombous. (Few copies of this book are available with some members of Sampada community. Students who need a copy, drop a mail to mail@sampadafoundation.org to check if anybody near you has one).

Difficulty level: Hard.

Conversion tool to convert from popular proprietary encodings in Kannada to Unicode

A platform independent converter between Nudi-Unicode, Baraha (including Baraha 5.0 encoding) to Unicode, SRG to Unicode, ISCII to Unicode, ShreeLipi to Unicode with a batch process mode. Input can be from text files, rtfs.

Significance:

Old proprietary encodings are still widely used by Kannada users. Most people still stick to it due to the absence of a good conversion utility that would help them switch once for all.
Much of the data populated in last decade by Kannada users have been in the old formats. A good conversion tool would help them shift to Unicode completely. This would increase the number of users using Unicode Kannada to search, blog and converse in Kannada on the Internet.

Links

Difficulty level: Easy

Lexical database

As on date, there is no publicly available corpus of Kannada words. A lexical database for Kannada needs to be planned and compiled. The project can be built on the lines of Wordnet and tools available for Wordnet can be localized to support this project.

Difficulty level: Easy

Aspell Kannada

Develop an Aspell dictionary for Kannada.

Word lists available presently for use:

  • Gi Vem Dictionary - compiled by Professor G Venkatasubbaiah. (Permission status: taken; Digitized format available: yes)
  • Kannada Saahithya Parishath Dictionary (Permission status: pending; Digitized format available: partial)
  • Mysore University Dictionary (permission status: pending; Digitized format available: No)

Related Link:

Difficulty level: Intermediate

Input methods on GNU/Linux

  • KP Rao layout on XKB
  • Baraha phonetic on XKB
  • Typewriter layout on XKB.
  • Fix KP Rao Layout (also known as Nudi Layout) on SCIM.
  • Fix Inscript layout on SCIM.
  • Add Typewriter layout on SCIM.
  • KP Rao Layout, Baraha phonetic, Typewriter Layout for IIIMF.

Difficulty level: Easy

Resources:

Typing tutor

Platform independent, preferably. It can cover:

  • Inscript Layout.
  • KP Rao Layout.
  • Typewriter Layout.

Difficulty level: Easy

Links:

Rendering issues on GNU/Linux

Priority could be solve the rendering issues on GNOME, KDE and XFCE.

Difficulty level: Intermediate

Dhvani - Kannada

Contribute to the TTS Program - Dhvani and complete the implementation of the Kannada Module. Integrate dhvani engine with GNOME/KDE. Write a web service wrapper over the dhvani engine for using in Web applications.

Links: Dhvani project page

Difficulty level: Intermediate

Translation

Localization of Drupal, Wordpress, Joomla, Moodle, Mediawiki to Kannada. (Drupal, Mediawiki and Moodle has been done for previous versions). Review Kannada Locale on debian.

Difficulty level: Easy

Astronomy and education related tools in Kannada language

A good contribution to the kids starting their education in Kannada medium would be to provide the wonderful tools present on GNU/Linux in Kannada. The applications:

  • Celestia
  • KStars
  • openuniverse
  • ssystem

The tools can also serve greatly in encouraging uneducated in Karnataka to learn easily in their native language.

Difficulty level: Easy

Tools for Mobile devices

Presently there are no standard tools available on mobile phones for basic input in Kannada language. There are various tasks undone in this direction.

Difficulty level: Intermediate

OCR

Either implement the objectives of projects such as Ocrad or JOCR with Kannada as the target language. Or develop a proof of concept character recognition library that can understand atleast printed Kannada.

Also check:

Difficulty level: Hard

Web tools

News aggregation from newspaper websites that use old proprietary encodings

Presently, various major sources of news (newspaper websites) in Kannada (Udayavani, Prajavani, Kannada Prabha) use old proprietary encoding to display Kannada characters. The information retrieved by crawling these websites can be converted and put into an aggregation.

An added search functionality, feature to add various other news feeds would help Kannada users find news in Kannada about any topic anytime.

Difficulty level: Intermediate

Script for on-browser Input in Kannada for Open source Content Management Systems

A basic script in JS for on-browser input that can be easily adapted through modules for Drupal, Moodle, Joomla, Mediawiki and other widely used Open Source Content Management Systems. Or this could be developed as a plugin for the popular tinymce, fckeditor, htmlarea tools.

The script could support various popular layouts namely, Baraha Phonetic, Nudi (K P Rao), Typewriter and Inscript. The script could use JS accelerators to toggle between various layouts and English.

The script could be built over presently available phonetic input solutions that are in GPL compatible license and perfected for various popular layouts of Kannada.

Related Link:

Difficulty level: Easy

Speech Recognition

Development of a proof of concept library/tool to interprete Kannada spoken language.

Difficulty level: Hard

Accessibility

Developing and adapting the above Kannada TTS, STT, other tools to the GNOME/KDE Desktop Environments as per their respective Usability/Accessibility Guidelines.

Difficulty level: Intermediate

Localised Linux Distribution

Develop a localised Linux distribution based on Debian/Ubuntu packaged and targeted mainly towards Government/Non-profits/Home. The distribution can have the major software available in Kannada.

Difficulty level: Intermediate

How to Apply

  1. FAQ: http://code.google.com/soc/2008/faqs.html
  2. Student Application Template
Personal tools