Language:
Paataantara

Paataantara

Levenshtein Distance Calculator

Pāṭhāntara



Pāṭhāntara is a web application of LDC-IL that calculates the edit distance between two given strings. It can be used to calculate the edit rate of any text in comparison with its modified text.

Pāṭhāntara utilizes the dynamic programming approach of computing the Levenshtein Distance. It constructs a matrix where each cell represents the edit distance between corresponding substrings of the input strings. By iteratively filling the matrix based on edit operations, such as insertion, deletion, or substitution, the algorithm determines the minimum edit distance between the strings. It tokenizes the input strings at the word level to calculate the edit rate.

Writers and editors can utilize Pāṭhāntara to assess the similarity between different versions of a text. It can be used for quality control of Spell checking, OCR and ASR systems by comparing their output with their post-corrected data.

Credits: Rajesha N, Linguistic Data Consortium for Indian Languages (LDC-IL), Central Institute of Indian Languages, Mysore.