e-magyar text processing system

Available presets:

Available tools (see documentation for more details on usage):

emToken
Tokenize with Stanza
UDPipe tokenizer
Tokenize, POS tag and lemmatize with Stanza as a whole
UDPipe tokenizer and POS tagger as a whole
Tokenize, POS tag, lemmatize and dep parse with Stanza as a whole
UDPipe tokenizer, POS tagger and dependency parser as a whole
emMorph
emTag (PurePOS)
emmorph2ud
emmorph2ud2
POS tag with Stanza (without lemmatisation)
POS tag and lemmatize with Stanza
UDPipe POS tagger
UDPipe POS tagger and dependency parser as a whole
emDep
emDep (limited to 50 token)
Dep parse with Stanza
UDPipe dependency parser
emCons
HunspellPy
emChunk
emNER
emBERT (baseNP)
emBERT (maxNP)
emBERT (NER)
IOB format converter and fixer for maxNP
IOB format converter and fixer for NER
Mark multiword terminology expressions from fixed list
Inserts zero pronouns (subjects, objects and possessors) into dependency parsed texts
emPhon phonetic transcriber with IPAization and with comment lines
emPhon phonetic transcriber with IPAization but without comment lines
emPhon phonetic transcriber without IPAization but with comment lines
emPhon phonetic transcriber without IPAization and comment lines
annotate compound boundaries
connect preverbs
EXAMPLE (The friendly name of DummyTagger used in REST API form)
A good-enough converter to GATE format for e-magyar.hu
CoNLL-U converter

Input text or file:

Output mode:

Result:

RAW WORD DEP CONS