1Jurilinguistic Engineering in Cantonese Chinese: An N-gram-based Speech to Text Transcription System2000 - B K T'sou,K K Sin,Samuel W. K. Chan,Tom Bong-yeung Lai,Caesar Suen Lun,K T Ko,Gil Chang Kim,Lawrence Y. L. Cheungbigram and trigram statistical data derived from domain-specific training. In Stage 3, manual editing of the transcribed...
2Memory-Efficient Katakana Compound Segmentation using Conditional Random Fieldsdictionary information, which gives it additional domain specific training data. The influence of dictionary data (D-feature)...
3Reordering Constraints for Phrase-Based Statistical Machine Translationin the domain of hotel reservation. Here, we use domain-specific training data in addition to the BTEC corpus. The corpus...