Anusaaraka maps constructions in the source language to the
corresponding constructions in the target language wherever possible.
For example, a noun or pronoun in the source language is mapped to an
approriate noun or pronoun, respectively, in the target language below:
T: mIru pustakaM caduvutunnArA?
@H: Apa pustaka paDha_raHA_[HE|thA]_kyA{23_ba.}?
!E: You book read_ing_[is|was] Q.?
E: Are/were you reading a book?
(Where the prefixes mean the following:
T=Telugu, @H=anusaaraka Hindi, !E=English gloss, E=English.)
In the example above, the last word in the sentence is a verb and
illustrates the mapping from Telugu to Hindi morpheme by morpheme: the
root is mapped to 'paDha' (read), and similarly the tense-aspect-modality
(TAM) lable is mapped to 'raHA_[HE|thA]' (is_*ing or was_*ing), which
is followed by 'A' sufix which gets mapped to 'kyA' (what) as a question
marker in Hindi. Telugu leaves the tense open as present or past, which
is reflected in the output. Gender, number, and person (GNP) information is
is also shown separately in curly brackets ('{23_ba.}' for second or
third person and bahu-vachana (plural)).
Sometimes, for a construction in the source language, the same construction
is not available in the target language. In such a case, the system
choses another construction in the target language in which the same
information can be expressed. In the example below, the system choses
the complementizer construction in Hindi (EsA) to express the same
sense:
T: mA ammAyiki uxyogaM ceVyyAlani lexu.
@H: hamArA_ ladakI_ko` nOkarI karanA_EsA nahIM_[hE|WA].
!E: Our daughter(dat.) job do_should_that not(fem.)
E: It is not the case that our daughter should get a job.
However, the anusaaraka shows the image and therefore, it uses the complementizer (EsA).
Sometimes there are slight differences between a construction in the
source language to a similar construction in the target language
because of which information might not be preserved. In
such a situation additional notation is introduced to express the
information which would otherwise get lost. A simple example of this
is the lack of distinction between personal pronoun and pronominal
adjective in Hindi: vaha. Telugu on the other hand has two different
words for these: vADu and A.
T: vADu shkUluku vellADu.
@H: vaha` pAThshAlA_ko` gayA.
!E: he school(dat.) went.
E: He went to school.
T: A pAThaSAlaku TrophI vacciMdi.
@H: vaha- pAThshAlA_ko` TrophI AyI.
!E: that school(dat.) trophy came
E: That school received the trophy.
When transferring from Telugu to Hindi, this distinction would have
disappeared, if care was not taken. In anusaaraka, the two forms are
made different by introducing additional notation:
vaha` (he)
vaha- (that)
A more complex example involving both the use of additional
notation and mapping to a different construction is the participial
construction in South Indian languages. This is an example of building
"language-bridges". In South Indian languages, there are a rich set of
participial verbs. Hindi has basically only two. Therefore, to map
all the participial verbs from these languages, the relative clause
construction in Hindi is used.
T: pani cesina rAmmUrti maMcivADu.
@H: kAma kiyA_HE_jo*_vaHa_ rAmmUrti_[-] bhalA_AdamI[acchA_{pu.}].
!E: work *who_has_done Ramamurti good_man
E: Ramamurti who has done the word is a good man.
It should be noted above that the participial construction in Telugu
('cesina' as a modifier of the following noun 'rAmmUrti') is mapped
to a relative clause construction in Hindi. However, there is some
information which is not present in the original Telugu construction,
but is required in the relative clause construction in Hindi (marked by
'*' above). This can be supplied either by
the user or by the second module based on a suitable knowledge base.
In this case, 'ne' should be put in Hindi for the doer of the work;
in general, it could be 'ko' for the karma, 'se' for the instrument
etc.
However, it should be rememberd that the second module would be
fragile and will fail at times. In those situations, the anusaaraka
output can be read by a reader to get the correct meaning.
There can be other failures from unexpected quarters. Here is an example:
T: UriniMci oVccina nalini mA ceVlleVlu.
@H: gAzva_se` AyA_hE_jo*_vaha_ bImArI_ko[hUz] hamArA_ CotI_bahana.
^0
!E: from_village *who_has_come illness_acc. my younger_sister.
^one_
E: Nalini, who has come from the village is my younger sister.
Here, 'nalini' a proper noun got translated to 'illness'. This failure
can cause difficulty for the second module if it tries to arrive
at a suitable replacement for '*', and the reader can fall back on the
core anusaaraka to infer the meaning.