Saturday, April 19, 2014

 

 



Soy un nuevo usuario

Olvidé mi contraseña

Entrada usuarios

Lógica Matemáticas Astronomía y Astrofísica Física Química Ciencias de la Vida
Ciencias de la Tierra y Espacio Ciencias Agrarias Ciencias Médicas Ciencias Tecnológicas Antropología Demografía
Ciencias Económicas Geografía Historia Ciencias Jurídicas y Derecho Lingüística Pedagogía
Ciencia Política Psicología Artes y Letras Sociología Ética Filosofía


Spell-checking in Spanish: the case of diacritic accents

1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario
    registrado en Universia

  Descargar recurso

Detalles del recurso

Pertenece a: RECERCAT  

Descripción: This article presents the problem of diacritic restoration (or diacritization) in the context of spell-checking, with the focus on an orthographically rich language such as Spanish. We argue that despite the large volume of work published on the topic of diacritization, currently available spell-checking tools have still not found a proper solution to the problem in those cases where both forms of a word are listed in the checker’s dictionary. This is the case, for instance, when a word form exists with and without diacritics, such as continuo‘continuous’ and continuó ‘he/she/it continued’, or when different diacritics make other word distinctions, as in continúo ‘I continue’. We propose a very simple solution based on a word bigram model derived from correctly typed Spanish texts and evaluate the ability of this model to restore diacritics in artificial as well as real errors. The case of diacritics is only meant to be an example of the possibleapplications for this idea, yet we believe that the same method could be applied to other kinds of orthographic or even grammatical errors. Moreover, given that no explicit linguistic knowledge is required, the proposed model can be used with other languages provided that a large normative corpus is available.

Autor(es): Atserias, Jordi -  Fuentes Fort, Maria -  Nazar, Rogelio -  Renau, Irene - 

Id.: 55205488

Idioma: inglés  - 

Versión: 1.0

Estado: Final

Palabras claveÀrees temàtiques de la UPC -  Informàtica -  Intel·ligència artificial -  Llenguatge natural - 

Tipo de recurso: Conference lecture  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

: Open Access

Requerimientos técnicos:  Browser: Any - 

Fecha de contribución: 06-may-2012

Contacto:

Localización:


Otros recursos del mismo autor(es)

  1. Deliverable 6.1 Infrastructure for Extractive Summarization SKATER Internal Report: software of infrastructure for extractive Summarization (work carried out un...
  2. Deliverable 6.1 Infrastructure for Extractive Summarization SKATER Internal Report: software of infrastructure for extractive Summarization (work carried out un...
  3. The Spanish learner's dictionary DAELE on the panorama of the Spanish e-lexicography This paper presents a prototype of an Internet-based Spanish dictionary for foreign learners, the "D...
  4. El paper de la Lingüística Computacional en la cerca d'informació Aquest article presenta una visió general de les noves aplicacions de cerca d’informació que estan e...
  5. UPC-CORE : What can machine translation evaluation metrics and Wikipedia do for estimating semantic textual similarity? In this paper we discuss our participation tothe 2013 Semeval Semantic Textual Similaritytask. Our c...

Otros recursos de la misma colección

  1. Defining a network management architecture This work proposes an algorithm called k-CriticalNode to solve the controller placement problem in S...
  2. Caracterización y trabajo de campo en el barrio de Mariahilfer-Viena [Desde] septiembre a finales de diciembre de 2012, se está haciendo una estancia en el Institüt für ...
  3. Environmental management tools for ports Ports are very complex systems from the point of view of environment. In fact, the very existence of...
  4. Using DDM asymmetry metrics for wind direction retrieval from GPS ocean-scattered signals in airborne experiments Reflectometry of signals of opportunity such as those emitted by a global navigation satellite syste...
  5. Optical signal processor for millimeter-wave interferometric radiometry In interferometric radiometry, the correlations between all pairs of radio-frequency (RF) receivers ...

Valoración de los usuarios

No hay ninguna valoración para este recurso.Sea el primero en valorar este recurso.
 

Busque un recurso