Multilingual
Lexicographical Data
Explore Wikidata's lexemes: words, grammatical forms, and senses. Query linguistic data for language learning and research.
Lexeme Structure
Wikidata lexemes represent dictionary entries with three components:
| Component | Description | Example |
|---|---|---|
| Lexeme | The dictionary entry (word) | L1234 "run" (English verb) |
| Form | Grammatical variations | "run", "runs", "ran", "running" |
| Sense | Different meanings | "to move quickly", "to operate" |
Basic Lexeme Queries
Find English Nouns
SELECT ?lexeme ?lemma
WHERE {
?lexeme dct:language wd:Q1860 ; # English
wikibase:lexicalCategory wd:Q1084 ; # noun
wikibase:lemma ?lemma .
}
LIMIT 50
Lexemes with Their Forms
SELECT ?lexeme ?lemma ?form ?formRep
WHERE {
?lexeme dct:language wd:Q1860 ;
wikibase:lexicalCategory wd:Q24905 ; # verb
wikibase:lemma ?lemma ;
ontolex:lexicalForm ?form .
?form ontolex:representation ?formRep .
FILTER(STR(?lemma) = "run")
}
Programming Terminology
Programming Terms as Lexemes
SELECT ?lexeme ?lemma ?lang ?langLabel
WHERE {
# Find lexemes whose sense links to programming concepts
?lexeme ontolex:sense ?sense .
?sense wdt:P5137 ?concept . # item for this sense
?concept wdt:P31/wdt:P279* wd:Q17155032 . # programming term
?lexeme wikibase:lemma ?lemma ;
dct:language ?lang .
SERVICE wikibase:label {
bd:serviceParam wikibase:language "en" .
}
}
LIMIT 50
Cross-Language Word Comparisons
Same Concept in Different Languages
SELECT ?concept ?conceptLabel
?enLemma ?frLemma ?deLemma
WHERE {
VALUES ?concept { wd:Q7239 } # organism
OPTIONAL {
?enLex ontolex:sense/wdt:P5137 ?concept ;
dct:language wd:Q1860 ; # English
wikibase:lemma ?enLemma .
}
OPTIONAL {
?frLex ontolex:sense/wdt:P5137 ?concept ;
dct:language wd:Q150 ; # French
wikibase:lemma ?frLemma .
}
OPTIONAL {
?deLex ontolex:sense/wdt:P5137 ?concept ;
dct:language wd:Q188 ; # German
wikibase:lemma ?deLemma .
}
SERVICE wikibase:label {
bd:serviceParam wikibase:language "en" .
}
}
Key Lexeme Properties
| Property | Description |
|---|---|
wikibase:lemma |
The canonical form of the word |
dct:language |
Language of the lexeme (Q-item) |
wikibase:lexicalCategory |
Part of speech (noun, verb, etc.) |
ontolex:lexicalForm |
Grammatical forms |
ontolex:sense |
Meanings/definitions |
wdt:P5137 |
Item for this sense (links to Q-items) |