For full functionality of Sketch Engine it is necessary to
enable JavaScript
zannoni
coliweb
wikiita
wikieng
wikiesp
wikipor
wikiger
wikifra
imag
paisa
imag_w
paisa_w
CoLIWeb
defaults
Reset settings
English
česky
slovensky
简体中文
繁體中文
Gaeilge
slovenščina
hrvatski
العربية
español
français
українська
polski
Home
Search
Word list
Corpus info
My jobs
User guide
All words
All lemmas
Find x
Menu position
This action may take several minutes for large corpora, please wait.
Word list options
Corpus:
CoLIWeb
Imagact spoken
Imagact spoken W
Paisa 1.6
Paisa 1.6 W
WikiHow - English
WikiHow - Spanish
WikiHow - French
WikiHow - German
WikiHow - Italian
WikiHow - Portuguese
Corpus Zannoni
Subcorpus:
create new
Search attribute:
word
tag
lemma
doc.sito
doc.url
doc.categoria
doc.produzione
doc.wordcount
use n-grams
. Value of n: from
2
3
4
5
6
to
2
3
4
5
6
hide/nest sub-n-grams
Filter options:
Filter word list by:
Regular expression:
Minimum frequency:
Maximum frequency:
(0 = no maximum frequency)
Whitelist:
Blacklist:
format
Word list whitelists and blacklists must be plain text (.txt), encoded in UTF-8, with one item per line. The items must correspond to the selected attribute, so, eg, if 'lemma' is selected from the attribute menu, then the list should be a list of lemmas. We use exact matching, not regular-expression matching, for file input.
Include non-words
Output options:
Frequency figures:
Hit counts
Document counts
ARF
Output type:
Simple
Keywords
Reference (sub)corpus
CoLIWeb
Imagact spoken
Imagact spoken W
Paisa 1.6
Paisa 1.6 W
WikiHow - English
WikiHow - Spanish
WikiHow - French
WikiHow - German
WikiHow - Italian
WikiHow - Portuguese
Corpus Zannoni
(whole corpus)
Prefer:
rare words
common words
Change output attribute(s)
---
word
tag
lemma
---
word
tag
lemma
---
word
tag
lemma
You can select one or more output attributes. Please note that this option can be time-consuming.