Monday, September 1, 2014

 

 



Soy un nuevo usuario

Olvidé mi contraseña

Entrada usuarios

Lógica Matemáticas Astronomía y Astrofísica Física Química Ciencias de la Vida
Ciencias de la Tierra y Espacio Ciencias Agrarias Ciencias Médicas Ciencias Tecnológicas Antropología Demografía
Ciencias Económicas Geografía Historia Ciencias Jurídicas y Derecho Lingüística Pedagogía
Ciencia Política Psicología Artes y Letras Sociología Ética Filosofía


Spell-checking in Spanish: the case of diacritic accents

1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario
    registrado en Universia

  Descargar recurso

Detalles del recurso

Pertenece a: RECERCAT  

Descripción: This article presents the problem of diacritic restoration (or diacritization) in the context of spell-checking, with the focus on an orthographically rich language such as Spanish. We argue that despite the large volume of work published on the topic of diacritization, currently available spell-checking tools have still not found a proper solution to the problem in those cases where both forms of a word are listed in the checker’s dictionary. This is the case, for instance, when a word form exists with and without diacritics, such as continuo‘continuous’ and continuó ‘he/she/it continued’, or when different diacritics make other word distinctions, as in continúo ‘I continue’. We propose a very simple solution based on a word bigram model derived from correctly typed Spanish texts and evaluate the ability of this model to restore diacritics in artificial as well as real errors. The case of diacritics is only meant to be an example of the possibleapplications for this idea, yet we believe that the same method could be applied to other kinds of orthographic or even grammatical errors. Moreover, given that no explicit linguistic knowledge is required, the proposed model can be used with other languages provided that a large normative corpus is available.

Autor(es): Atserias, Jordi -  Fuentes Fort, Maria -  Nazar, Rogelio -  Renau, Irene - 

Id.: 55205488

Idioma: English  - 

Versión: 1.0

Estado: Final

Palabras claveÀrees temàtiques de la UPC -  Informàtica -  Intel·ligència artificial -  Llenguatge natural - 

Tipo de recurso: Conference lecture  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

: Open Access

Requerimientos técnicos:  Browser: Any - 

Fecha de contribución: 06-may-2012

Contacto:

Localización:


Otros recursos del mismo autor(es)

  1. A quantitative approach to concept analysis The present research focuses on the study of the distribution of lexis in corpus and its aim is to i...
  2. A Flexible Multitask Summarizer for Documents from Different Media, Domain and Language Automatic Summarization is probably crucial with the increase of document generation. Particularly w...
  3. Deliverable 6.1 Infrastructure for Extractive Summarization SKATER Internal Report: software of infrastructure for extractive Summarization (work carried out un...
  4. Deliverable 6.1 Infrastructure for Extractive Summarization SKATER Internal Report: software of infrastructure for extractive Summarization (work carried out un...
  5. The Spanish learner's dictionary DAELE on the panorama of the Spanish e-lexicography This paper presents a prototype of an Internet-based Spanish dictionary for foreign learners, the "D...

Otros recursos de la misma colección

  1. Towards standardized integration of images in the cloud of linked data Currently, there are several ways of describing and referring to images in RDF. This ambiguity resul...
  2. Experiences with mobile processors for energy efficient HPC The performance of High Performance Computing (HPC) systems is already limited by their power consum...
  3. Deconstructing bus access control policies for real-time multicores Multicores may satisfy the growing performance requirements of critical Real-Time systems which has ...
  4. Programmable and scalable reductions on clusters Reductions matter and they are here to stay. Wide adoption of parallel processing hardware in a broa...
  5. Business process mining from e-commerce web logs The dynamic nature of the Web and its increasing importance as an economic platform create the need ...

Valoración de los usuarios

No hay ninguna valoración para este recurso.Sea el primero en valorar este recurso.
 

Busque un recurso