Publicidad

Publicidad

becas.universia.netBiblioteca.Net

Buscar recursos:

Buscador Google

rss_1.0 Recursos de colección

Universidade da Coruña. UDCDspace (405 recursos)
UDCDspace é o repositorio dixital da Universidade da Coruña, un sistema que proporciona de xeito estable e seguro a preservación de documentos dixitais produto da actividade científica e institucional da UDC, e facilita a súa accesibilidade en Internet.

Mostrando recursos 1 - 14 de 14

1. Spelling Correction on Technical Documents - Vilares Ferro, Manuel; Otero Pombo, Juan; Graña Gil, Jorge
We describe a novel approach to spelling correction applied on technical documents, a task that requires a number of especific properties such as eficiency, safety and maintenance. In opposite to previous works, we explore the region close to the point at which the recognition halts, gathering all relevant information for the repair process in order to avoid the phenomenom of errors in cascade. Our approach seems to reach the same quality provided by the most performance classic techniques, but with a significant reduction on both time and space costs

2. A Tagger Environment for Galician - Vilares Ferro, Manuel; Graña Gil, Jorge; Araujo, T; Cabrero Souto, David; Diz, I.
In this paper, we introduce a tagger environment for Galician, the native language of Galicia. Galician belongs to the group of Romance languages which developed from the Latin imposed on the north-west of the Iberian Peninsula by the Romans, with additions from the languages of peoples living here before the colonization, as well as contributions from other languages subsequent to the breaking-up of the Roman Empire. Various historical circumstances led to its not becoming a State language and although it was relegated to informal usage, our vernacular has managed to survive well into the twentieth century when, parallel to the recovery...

3. Regional Finite-State Error Repair - Vilares Ferro, Manuel; Otero Pombo, Juan; Graña Gil, Jorge
We describe an algorithm to deal with error repair over finite-state architectures. Such a technique is of interest in spelling correction as well as approximate string matching in a variety of applications related to natural language processing, such as information extraction/recovery or answer searching, where error-tolerant recognition allows misspelled input words to be integrated in the computational process. Our proposal relies on a regional least-cost repair strategy, dynamically gathering all relevant information in the context of the error location. The system guarantees asymptotic equivalence with global repair strategies.

4. Communication Protocols Verification with Esterel - Graña Gil, Jorge; Vilares Ferro, Manuel; Bernhard, R.
This work summarizes design, implementation and verification processes of a digital telephone switchboard in the Esterel real-time programming environment. Our aim is to show the modularity in the description and of flexibility in the verification process. We also show the control synchronization mechanisms to coordinate concurrent processes. The goal is to prevent in compile-time deadlock and lockout phenomena, a feature that is not available in most programming languages.

5. Verificación de Conexiones Telefónicas con Esterel - Graña Gil, Jorge; Vilares Ferro, Manuel; Bernhard, R.
El presente trabajo resume el proceso de diseño, implementación y verificación del comportamiento de una centralita telefónica digital en el entorno de programación de tiempo real síncrono Esterel. Nuestra intención es mostrar la modularidad de la aplicación y la flexibilidad del proceso de verificación. Idéntica atención merecen los mecanismos de control que gestionan la sincronización de procesos concurrentes. El objetivo es detectar los fenómenos de abrazo mortal e interbloqueo en tiempo de compilación, una característica no disponible en todos los lenguajes de programación.

6. Una Aplicación de RI basada en PLN: el Proyecto ERIAL - Barcala Rodríguez, Francisco Mario; Domínguez, Eva María; Alonso Pardo, Miguel Ángel; Cabrero Souto, David; Graña Gil, Jorge; Vilares Ferro, Jesús; Vilares Ferro, Manuel; Rojo, Guillermo.; Santalla, María Paula; Sotelo, Susana

7. Integrating External Dictionaries into Part-of-Speech Taggers - Graña Gil, Jorge; Chappelier, J.-C.; Vilares Ferro, Manuel
The highest performances in part-of-speech tagging have been obtained by using stochastic methods, such as hidden Markov models. The running parameters of a hidden Markov model for tagging can be estimated from tagged corpora. However, the current situation in the automatic processing of some languages is very short training texts, but very large dictionaries. These dictionaries can provide very useful information for improving the treatment of unknown words. In this paper we present new strategies for integrating external dictionaries into a stochastic tagging framework. Instead of the most intuitive Adding One method, we propose the use of the Good-Turing formulas,...

8. Parsing as Resolution - Vilares Ferro, Manuel; Graña Gil, Jorge
A general context-free parsing algoritm based on logical dynamic programming techniques is described. The analyzer takes a general class of context-free grammar as drivers, and any finite string as input. In an empirical comparison, the new system appears to be superior to the others context-free analysers (as for example the SDF system), and comparable to the standard generators of deterministic parsers (as for example YACC, the standard generator of compilers in UNIX) when the input string is not ambiguous.

9. A Spanish e-Dictionary of Synonyms as a Fuzzy Tool for Information Retrieval - Fernández Lanza, Santiago; Graña Gil, Jorge; Sobrino Cerdeiriña, Alejandro

10. Tokenization and Proper Noun Recognition for Information Retrieval - Barcala Rodríguez, Francisco Mario; Vilares Ferro, Jesús; Alonso Pardo, Miguel Ángel; Graña Gil, Jorge; Vilares Ferro, Manuel
In this paper we consider a set of natural language processing techniques that can be used to analyze large amounts of texts, focusing on the advanced tokenizer which accounts for a number of complex linguistic phenomena, as well as for pretagging tasks such as proper noun recognition. We also show the results of several experiments performed in order to study the impact of the strategy chosen for the recognition of proper nouns.

11. GALENA: Tabular DCG Parsing for Natural Languages - Vilares Ferro, Manuel; Alonso Pardo, Miguel Ángel; Graña Gil, Jorge; Cabrero Souto, David
We present a definite clause based parsing environment for natural languages, whose operational model is the dynamic interpretation of logical push-down automata. We attempt to briefly explain our design decisions in terms of a set of properties that practical natural language processing systems should incorporate. The aim is to show both the advantages and the drawbacks of our approach.

12. Une Approche Formelle pour la Génération d'Analyseurs de Langages Naturels - Vilares Ferro, Manuel; Valderruten Vidal, Alberto; Graña Gil, Jorge; Alonso Pardo, Miguel Ángel
Un processus d'analyse syntaxique et d'annotation efficace est déterminante dans l'élaboration de structures d'analyse de langages naturels. Ce papier introduit un environnement de programmation permettant l'implémentation du support formel des langages naturels depuis deux points de vue, analyse syntaxique et annotation. Le problème de l'analyse syntaxique se pose dans le domaine de l'analyse de grammaires algébriques sans restrictions, et celui de l'annotation dans le contexte des automates finis non déterministes. L'analyseur syntaxique prends en entrée un texte arbitraire, suivant la structure désignée par une grammaire algébrique. La structure de la forêt partagée résultante est étudiée par rapport à l'optimisation du partage...

13. Una Aplicación de RI basada en PLN: el Proyecto ERIAL - Barcala Rodríguez, Francisco Mario; Domínguez, Eva María; Alonso Pardo, Miguel Ángel; Cabrero Souto, David; Graña Gil, Jorge; Vilares Ferro, Jesús; Vilares Ferro, Manuel; Rojo, Guillermo.; Santalla, María Paula; Sotelo, Susana

14. Tokenization and Proper Noun Recognition for Information Retrieval - Barcala Rodríguez, Francisco Mario; Vilares Ferro, Jesús; Alonso Pardo, Miguel Ángel; Graña Gil, Jorge; Vilares Ferro, Manuel
In this paper we consider a set of natural language processing techniques that can be used to analyze large amounts of texts, focusing on the advanced tokenizer which accounts for a number of complex linguistic phenomena, as well as for pre-tagging tasks such as proper noun recognition. We also show the results of several experiments performed in order to study the impact of the strategy chosen for the recognition of proper nouns