Development of the Text Corpus

Whole corpus

2022–2025 0 400k 800k 1.2M 1.6M 2.0M 1.45M 1.49M 1.58M 1.69M 2022 (v17) 2023 (v18) 2024 (v19) 2025 (v20) +16% Increase (2022–2025)

Verified

2022 (v17)2023 (v18)2024 (v19)2025 (v20)
Amount of tokens1454723149137015773501687196
Token Distribution: With vs Without Hieroglyphs 2025 (v20) 0 50k 100k 150k 200k 250k 214k 229k 154k 190k 171k 45k 100k 56k 52k 76k 1k 5k Scientific Texts Funerary Texts Literary Texts Historical Texts Various Projects Misc. Periods Tomb Inscriptions Religious Texts Temple Inscriptions Administrative Museum Texts Rock Inscriptions Token with hieroglyphs Token without hieroglyphs

Hieroglyphic/hieratic

2022–2025 0 5000 10000 15000 20000 25000 21279 7989 8472 5259 3002 Text Subtext Object Object part Others 2022 2023 2024 2025

Verified

2022 (v17)2023 (v18)2024 (v19)2025 (v20)
Text18747192712022121279
(Undefined)2466
Subtext7321745074857989
Object7241759380868472
(Undefined)1
Object part4596478149295259
Arrangement384387390384
Group549561606640
Scene1239124012411240
Collective caption628640683738
40707419274364846007

Verification pending

2022 (v17)2023 (v18)2024 (v19)2025 (v20)
Verification pending22

Inactive/Archived, obsolete

2022 (v17)2023 (v18)2024 (v19)2025 (v20)
Archived, obsolete2171

Demotic

Kategorienentwicklung 2022–2025 0 400 800 1200 1600 2000 1729 1597 280 Text/Subtext Object Others 2022 2023 2024 2025

Verified

2022 (v17)2023 (v18)2024 (v19)2025 (v20)
Text1690169217131723
Subtext666
Object1563156515871597
Super-text32323232
Collective caption247247248248
3532354235863606

Inactive/Archived, obsolete

2022 (v17)2023 (v18)2024 (v19)2025 (v20)
Archived, obsolete11