Language:
Padavrtti

Padavrtti

word-frequency counter
Download Link

padāvr̥tti

Padāvr̥tti is a word frequency counter developed at LDC-IL. It generates a list of unique words along with their respective frequencies derived from the provided TXT and XML files.

It offers user to exclude numerals, Roman letters and punctuation marks from frequency list. The output will be generated as tab-separated txt file at the location of Padāvr̥tti . The output file contains the list of distinct tokens alongside their frequencies. The output filename shown as 'Padavrtti'-{distinct tokens processed}-{total tokens processed}.

Credits: Rajesha N, Linguistic Data Consortium for Indian Languages (LDC-IL), Central Institute of Indian Languages, Mysore.

Padavrtti Interface :