Friday, October 24, 2014

 

 



Soy un nuevo usuario

Olvidé mi contraseña

Entrada usuarios

Lógica Matemáticas Astronomía y Astrofísica Física Química Ciencias de la Vida
Ciencias de la Tierra y Espacio Ciencias Agrarias Ciencias Médicas Ciencias Tecnológicas Antropología Demografía
Ciencias Económicas Geografía Historia Ciencias Jurídicas y Derecho Lingüística Pedagogía
Ciencia Política Psicología Artes y Letras Sociología Ética Filosofía


Spell-checking in Spanish: the case of diacritic accents

1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario
    registrado en Universia


  Descargar recurso

Detalles del recurso

Pertenece a: RECERCAT  

Descripción: This article presents the problem of diacritic restoration (or diacritization) in the context of spell-checking, with the focus on an orthographically rich language such as Spanish. We argue that despite the large volume of work published on the topic of diacritization, currently available spell-checking tools have still not found a proper solution to the problem in those cases where both forms of a word are listed in the checker’s dictionary. This is the case, for instance, when a word form exists with and without diacritics, such as continuo‘continuous’ and continuó ‘he/she/it continued’, or when different diacritics make other word distinctions, as in continúo ‘I continue’. We propose a very simple solution based on a word bigram model derived from correctly typed Spanish texts and evaluate the ability of this model to restore diacritics in artificial as well as real errors. The case of diacritics is only meant to be an example of the possibleapplications for this idea, yet we believe that the same method could be applied to other kinds of orthographic or even grammatical errors. Moreover, given that no explicit linguistic knowledge is required, the proposed model can be used with other languages provided that a large normative corpus is available.

Autor(es): Atserias, Jordi -  Fuentes Fort, Maria -  Nazar, Rogelio -  Renau, Irene - 

Id.: 55205488

Idioma: English  - 

Versión: 1.0

Estado: Final

Palabras claveÀrees temàtiques de la UPC -  Informàtica -  Intel·ligència artificial -  Llenguatge natural - 

Tipo de recurso: Conference lecture  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

: Open Access

Requerimientos técnicos:  Browser: Any - 

Fecha de contribución: 06-may-2012

Contacto:

Localización:


Otros recursos del mismo autor(es)

  1. Collocations: a challenge in computer assisted language learning The correct use of collocations is one of the most difficult tasks that the student faces when learn...
  2. A quantitative approach to concept analysis The present research focuses on the study of the distribution of lexis in corpus and its aim is to i...
  3. A Flexible Multitask Summarizer for Documents from Different Media, Domain and Language Automatic Summarization is probably crucial with the increase of document generation. Particularly w...
  4. Deliverable 6.1 Infrastructure for Extractive Summarization SKATER Internal Report: software of infrastructure for extractive Summarization (work carried out un...
  5. Deliverable 6.1 Infrastructure for Extractive Summarization SKATER Internal Report: software of infrastructure for extractive Summarization (work carried out un...

Otros recursos de la misma colección

  1. The automorphism group of the non-split Cartan modular curve of level 11 We derive equations for the modular curve X-ns(11) associated to a non-split Cartan subgroup of GL(2...
  2. A secure communication system based on a modified chaotic chua oscillator In this paper we propose a new scheme for secure communications us- ing a modified Chua oscillator. ...
  3. Fracture toughness of zirconia from a shallow notch produced by ultra-short pulsed laser ablation Ceria partially stabilized zirconia ceramics (Ce-TZP) with identical grain size and different amount...
  4. Anhydric maleic functionalization and polyethylene glycol grafting of lactide-co-trimethylene carbonate copolymers Lactide and trimethylene carbonate copolymers were successfully grafted with polyethylene glycol via...
  5. Effect of structure, topography and chemistry on fibroblast adhesion and morphology Surface biofunctionalisation of many biodegradable polymers is one of the used strategies to improve...

Valoración de los usuarios

No hay ninguna valoración para este recurso.Sea el primero en valorar este recurso.
 

Busque un recurso