Available presets:
None
Full pipeline
Raw text to morphologycal analysis
Raw text to POS-tagging in emMorph formalism
Raw text to maximal NPs chunking
Raw text to named-entity annotation
Raw text to POS-tagging including UDv1 form
Raw text to dependency parsing
Raw text to dependency parsing using Stanza Dependency parser
Raw text to dependency parsing in CoNLL-U format
Raw text to dependency parsing in CoNLL-U format using Stanza Dependency parser
Raw text to constituent parsing
Raw text to emBERT named-entity annotation
Raw text to emBERT maximal NP chunking
Available tools (see
documentation
for more details on usage):
emToken
Tokenize with Stanza
UDPipe tokenizer
Tokenize, POS tag and lemmatize with Stanza as a whole
UDPipe tokenizer and POS tagger as a whole
Tokenize, POS tag, lemmatize and dep parse with Stanza as a whole
UDPipe tokenizer, POS tagger and dependency parser as a whole
emMorph
emTag (PurePOS)
emmorph2ud
emmorph2ud2
POS tag with Stanza (without lemmatisation)
POS tag and lemmatize with Stanza
UDPipe POS tagger
UDPipe POS tagger and dependency parser as a whole
emDep
emDep (limited to 50 token)
Dep parse with Stanza
UDPipe dependency parser
emCons
HunspellPy
emChunk
emNER
emBERT (baseNP)
emBERT (maxNP)
emBERT (NER)
IOB format converter and fixer for maxNP
IOB format converter and fixer for NER
Mark multiword terminology expressions from fixed list
Inserts zero pronouns (subjects, objects and possessors) into dependency parsed texts
emPhon phonetic transcriber with IPAization and with comment lines
emPhon phonetic transcriber with IPAization but without comment lines
emPhon phonetic transcriber without IPAization but with comment lines
emPhon phonetic transcriber without IPAization and comment lines
annotate compound boundaries
connect preverbs
EXAMPLE (The friendly name of DummyTagger used in REST API form)
A good-enough converter to GATE format for e-magyar.hu
CoNLL-U converter
Input text or file:
Output mode:
Display
Download
Result:
RAW
WORD
DEP
CONS