2. Port4NooJ 3.1
2
LG of
Support
Verb Fazer
2017
LG of
Human
Intransitive
Adjectives
2015
LG of
Support
Verb
ser de
2018
eSPERTo Smart Paraphrasing System
Port4NooJ
Genesis 2009: OpenLogos
ü Bilingual PT-EN
ü Morpho-syntactic Relations
ü Semantico-syntactic Properties (SAL)
ü Derivational Relations
ü Support Verb Constructions
ü Semantic Relations
ü SentiLex 2016
ü Stencil NER 2016
8. Transformations based on noun predicates
with Vsup ser de
8
Negation Paraphrases
[in-N] O Pedro é de uma certa intolerância à lactose
Pedro is of a certain intolerance to lactose
[falta de N] ([lack of N]) O Pedro é de uma certa falta de tolerância à lactose
Pedro is of a certain lack of tolerance to lactose
Vsup Substitution Paraphrases
[Vsup=ser de] A Ana é de uma alegria contagiante
Ana is of a contagious happiness
[Vsup=ter] A Ana tem uma alegria contagiante
Ana has a contagious happiness
[Vsup=haver] Há na Ana uma alegria contagiante
There is a contagious happiness in Ana
[Vsup=ser de] A Ana é de uma alergia ao pó impressionante
Ana is of an impressive allergy to dust
[Vsup=faz] A Ana faz uma alergia ao pó impressionante
Ana makes an impressive allergy to dust
9. – Major challenges
² 50% of the predicate nouns already exist in Port4NooJ
Ø Old news: somewhat being addressed since we started integrating the LG
with Vsup fazer
- consolidate information from old entry and LG table
- solution is far from perfect
- needs thorough revision
² 55% of the cases where the predicate nouns have an equivalent
adjectival construction, the adjective is homograph of a human
intransitive adjective (HIA) already formalized in the LG of HIA
Ø New problem! Adjectives equivalent to predicate nouns are being treated
by derivation. Not sure how to harmonize those derived entries with the HIA
entries yet…
Integration of LG of Vsup ser de
9
12. – From LG tables to NooJ dictionaries
• Representation of LG table properties
Integration of LG of PT Vsup ser de
12
• DRV code is determined and formalized automatically by finding the radical
between the noun and the verb or adjective that are listed in a separate file
activ(idade) => N2A5= <B5>o/A
• FLX code of derived word is determined by consulting Port4NooJ
activo,A+FLX=ALTO+AV+state+EN=brisk+DRV=AVDRV01:RAPIDAMENTE
If the derived form does not exist, then its code is assigned automatically
+DRV=N2A5:ALTO
+Prep… +N1…
13. – From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
① Noun not in Port4NooJ (old or current):
ü Create new entry:
ü FLX code is assigned automatically given the ending of the word
ü Entries are checked for missing FLX codes and reviewed by a linguist
ü All other properties come from LG table
ü Add entry to new standalone dictionary npred_vsupserde.dic
airosidade,N+FLX=CASA+Npred+Vsup=ser+Table=SdH1+N0Nhum+N0Npabst+N0Ncl
asspessoa+DetEModif+DetUMModif+Vsupter=ter+VsupteroNdeVinf0w=ter+
DRV=N2A5:ALTO
13
Integration of LG of PT Vsup ser de
14. – From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
② Noun exists both in current and old Port4NooJ
A. If entries are the same do Merge 1:
ü Blindly add additional properties as specified by the LG tables to current entries
ü Add merged entries to npred_vsupserde.dic
14
Integration of LG of PT Vsup ser de
aprumo,N+FLX=ANO+AB+prop+EN=aplomb
+Npred
+Vsup=ser+Table=SdH1
+Negfaltade
+N0Nhum
+N0Npc
+N0Npabst+N0Nclasspesso
a+DetEModif
+DetUMModif
+Vsupter=ter+VsupteroNd
eVinf0w=ter
+Vsuphaver=haver+DRV=N2
A16:ALTO
15. 15
Integration of LG of PT Vsup ser de
– From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
② Noun exists both in current and old Port4NooJ
A. If entries are the same do Merge 1:
ü Blindly add additional properties as specified by the LG tables to current entries
ü Add merged entries to npred_vsupfazer.dic
aprumo,N+FLX=ANO+AB+prop+EN=aplomb+Npred+Vsup=ser+Table=SdH1+Negfal
tade+N0Nhum+N0Npc+N0Npabst+N0Nclasspess+DetEModif+DetUMModif+Vs
upter=ter+VsupteroNdeVinf0w=ter+Vsuphaver=have+DRV=N2A16:ALTO
16. – From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
② Noun exists both in current and old Port4NooJ
B. If entries are not the same do Merge 2 with old entries as shown in case 3:
ü Remove previous Npred related properties
ü Blindly add additional properties as specified by the LG tables to old entries
ü Add merged entries to npred_vsupserde.dic
ü Remove nominalization from CV
16
Integration of LG of PT Vsup ser de
Entries in CV:
avidez,N+FLX=LUZ+AB+qual+EN=avidity
avidez,N+FLX=LUZ+AB+qual+EN=greed
Entries in OV:
avidez,N+FLX=LUZ+AB+strvb+Npred+Nom+EN=acquisitiveness+VRB=ansiar
17. – From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
② Noun exists both in current and old Port4NooJ
B. If entries are not the same do Merge 2 with old entries as shown in case 3:
ü Remove previous Npred related properties
ü Blindly add additional properties as specified by the LG tables to old entries
ü Add merged entries to npred_vsupserde.dic
ü Remove nominalization from CV
17
Integration of LG of PT Vsup ser de
Entries in CV:
avidez,N+FLX=LUZ+AB+qual+EN=avidity
avidez,N+FLX=LUZ+AB+qual+EN=greed
Entries in OV:
avidez,N+FLX=LUZ+AB+strvb+Npred+Nom+EN=acquisitiveness
+VRB=ansiar
+Npred
+Vsup=ser+Table=SdQ
0
+N0Nhum+N0NpreddeN+
N0NopQueF+N0RestrNo
pQueF+N0QueFconj+N0
OfactodeVinf0w
+N0N0Vinf0w+N0Restr
Vinf0w+N0Nclass+N0N
classpessoa+DetEMod
if+DetUMModif+Vsupt
er=ter+Vsuphaver=ha
ver+DRV=N2A18:ALTO
18. – From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
② Noun exists both in current and old Port4NooJ
B. If entries are not the same do Merge 2 with old entries as shown in case 3:
ü Remove previous Npred related properties
ü Blindly add additional properties as specified by the LG tables to old entries
ü Add merged entries to npred_vsupserde.dic
ü Remove nominalization from CV
18
Integration of LG of PT Vsup ser de
Entries in CV:
avidez,N+FLX=LUZ+AB+qual+EN=avidity
avidez,N+FLX=LUZ+AB+qual+EN=greed
Entries in OV:
avidez,N+FLX=LUZ+AB+strvb+Npred+Nom+EN=acquisitiveness
+VRB=ansiar+Npred+Vsup=ser+Table=SdQ0+N0Nhum+N0NpreddeN+N0NopQu
eF+N0RestrNopQueF+N0QueFconj+N0OfactodeVinf0w+N0N0Vinf0w+N0Rest
rVinf0w+N0Nclass+N0Nclasspessoa+DetEModif+DetUMModif+Vsupter=te
r+Vsuphaver=haver+DRV=N2A18:ALTO
19. – From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
③ Noun exists only in old Port4NooJ
ü Do Merge 2 with old entries as shown in Case 2-B:
ü Remove previous Npred related properties
ü Blindly add additional properties as specified by the LG tables to old entries
ü Add merged entries to npred_vsupserde.dic
ü Remove nominalization from CV
19
Integration of LG of PT Vsup ser de
capricho,N+FLX=ANO+AB+strvb+Npred+Nom+EN=caprice+VRB=caprichar
20. – From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
③ Noun exists only in old Port4NooJ
ü Do Merge 2 with old entries as shown in Case 2-B:
ü Remove previous Npred related properties
ü Blindly add additional properties as specified by the LG tables to old entries
ü Add merged entries to npred_vsupserde.dic
ü Remove nominalization from CV
20
Integration of LG of PT Vsup ser de
+Npred+Vsup=ser
+Table=SdH1+N0Nhum
+N0Npabst
+N0Nclasspessoa
+DetE
+Vsupter=ter
+VsupserumNdpdNhum=ser
+Vsuphaver=haver
+DRV=N2A25:ALTO
capricho,N+FLX=ANO+AB+strvb+EN=caprice
21. – From LG tables to NooJ dictionaries
• Integration with eSPERTo dictionary entries
③ Noun exists only in old Port4NooJ
ü Do Merge 2 with old entries as shown in Case 2-B:
ü Remove previous Npred related properties
ü Blindly add additional properties as specified by the LG tables to old entries
ü Add merged entries to npred_vsupserde.dic
ü Remove nominalization from CV
21
Integration of LG of PT Vsup fazer
capricho,N+FLX=ANO+AB+strvb+EN=caprice+Npred+Vsup=ser+Table=Sd
H1+N0Nhum+N0Npabst+N0Nclasspessoa+DetE+Vsupter=ter+Vsupser
umNdpdNhum=ser+Vsuphaver=haver+DRV=N2A25:ALTO
25. Preliminary Results
25
• 2132 predicate nouns with Vsup ser de (1376 different noun lemmas)
– Additional 797 entries await revision of inflectional codes of derived adjectives
or have format problems to be added to the final dictionary
• 450 new derivational paradigms, but there might be overlap with paradigms
created when integrating LG of vsup fazer
• Example grammars for the syntactic parser
• Half of the nouns already existed in Port4NooJ (50%)
è 6% increase in nominal entries and 20% increase in predicate nouns
Table Example In Port4NooJ New % In
SdH1 O Zé é de uma alegria contagiante 183 208 47%
SdH2 O Zé é da confiança da Ana 41 14 75%
SdNH1 Este molho é de uma acidez exagerada 153 211 42%
SdNH2 Esta substância é de uma total indissolubilidade em água 16 15 52%
SdNPC O rosto da Ana era de uma palidez doentia 7 24 23%
SdQ0 Essa medida é de grande abrangência 309 512 38%
SdQ1 O Zé foi de uma agressividade desproporcionada para com a Ana 162 147 52%
SdQ2 O Zé é de uma grande habilidade para tratar das roseiras 22 16 58%
SdSIM O Zé e a Ana são de um companheirismo exemplar 34 22 61%
Total 927 1169 50%