Skip to content

This project aims to create a corpus for solving the problem of binary classification between natural text and "lectura facil" text

Notifications You must be signed in to change notification settings

fernandorpalvarez/corpus-lectura-facil

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

94 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lectura Fácil Corpus in Spanish

This project is part of the result of a Master's thesis where we seek to create a text corpus, from scratch and in spanish, to resolve the problem of binary clasification between text written in general natural language or "Lectura fácil", a term in spanish that means "Easy read". This project has been made in collaboration with the Polytechnic University of Madrid.

What is "Lectura Fácil"?

According to the official definition: It is a method that includes a set of guidelines and recommendations related to the writing of texts, the design and layout of documents and the validation of their comprehensibility, aimed at making information accessible to people with reading comprehension difficulties.

This blue logo is the European logo of "Easy read". If you see this logo on a website, a document or a book it means that it is easy to read:

If you see this logo in a file it means that it is written in validated in "Easy read" metholodgy

In this website you can find some useful information and resources about "Easy read" methodology in Spanish.

Some context?

"Easy read" methodology is a language discipline that belongs to the text simplification field, that was born in Sweden in the 60s and which today you can find a lot of information and text corpora in "Easy read" methodology in English, but in Spanish it is another story. Since the last 10 years approximately a some papers have been published and research has been made regarding Spanish text simplification, but there has not been as much progress in this field as in English, that is why there is still no standardized Spanish "Easy read" corpora.

About

This project aims to create a corpus for solving the problem of binary classification between natural text and "lectura facil" text

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages