Skip to content
/ g2p-id Public
forked from Wikidepia/g2p-id

Indonesian Grapheme-to-Phoneme (IPA notation)

License

Notifications You must be signed in to change notification settings

fdvky1/g2p-id

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Indonesian Grapheme-to-Phoneme

This module is designed to convert Indonesian graphemes (spelling) into phonemes (pronunciation). Fortunately, most Indonesian word pronunciations can be inferred from their spelling.

Big thanks to Wilson Wongso for sharing about Predicting Phonemes with BERT. I used his code to implement the predictor used in this module.

Installation

pip install git+https://github.com/Wikidepia/g2p-id

Example usage

from g2p_id import G2P

g2p = G2P()
g2p("Rumah Agus terbakar.") # ˈrumah ˈaɡʊs tərˈbakar.

References

TODO

  • Add test cases
  • Better model for predicting "e"
  • Handle heteronym

About

Indonesian Grapheme-to-Phoneme (IPA notation)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%