WikidataCon 2021

Suas preferências de localidade foram salvas. Acreditamos que temos um excelente suporte para inglês em pretalx, mas se você encontrar problemas ou erros, por favor entre em contato conosco!

Ninai (நினை) and Udiron (উদীরণ): text generation with Wikidata items and lexemes
30/10/2021 , Room 2

In the lead-up to Abstract Wikipedia's launch, a sufficient body of linguistic information, requiring more thorough consideration of certain linguistic aspects sooner rather than later, must be in place so that different sets of functions can work together to produce naturally-sounding text.

This session introduces Ninai and Udiron, two related tools with which functions can be built to generate text based on the linguistic information for a given language. In doing so it will discuss the compositionality and manipulability of lexical units, the breadth and interconnectedness of meaning units, and the treatment of variation among a language’s lects broadly construed, and how they can be dealt with in those tools.

Special reference to the handling of these aspects for Bengali and a number of other languages will be presented.


Link to notes

https://etherpad.wikimedia.org/p/WikidataCon2021-Sisterprojects-Languages

Quais conhecimentos os participantes irão obter nesta sessão?
  • Participants will get a brief look at some of the decisions made in setting up lexicographical data for a particular language.
  • Participants will gain a better understanding of where best to focus their attention on-wiki and off-wiki in order to ease the development of text generation systems for their languages.
  • Participants will learn the basics of how Ninai and Udiron work for the languages it currently supports.
Idioma

English

Gravação

Yes

Esse palestrante também aparece em: