Sensory Processing of Linguistic Pitch as Reflected by the Mismatch Negativity
- Bharath Ch; Ananthanarayan Krishnan; Jackson T. G
Objective: To assess the extent to which acoustic and phonetic change-detection processes contribute to the mismatch negativity (MMN) to linguistic pitch contours. Design: MMN was elicited from Mandarin and English speakers using a passive oddball paradigm. Two oddball conditions were constructed. In one condition (T1/T2i), the Mandarin high-level tone (T1) was compared with a convex high-rising tone (inverted T2, henceforth referred to as T2i) that occurs as a contextual variant of T1 in running speech. In the other (T2/T2i), the concave high-rising tone (T2) was compared with T2i. Phonetically, T1/T2i represents a within-category contrast for native speakers, whereas T2/T2i represents...
Ambiguity and Contradiction From a Morpho- Syntactic Prototype Perspective
- M. D. López De Luise
Abstract — In this paper, the contradiction and ambiguity problems in Natural language Processing are briefly introduced. We also present the morpho-syntactic WIH (Web Intelligent Handler) prototype and the overall approach it takes to process any Spanish text. Finally, we analyze how it processes Spanish sentences with contradictions or ambiguities using its own perspective, despite deeper linguistic considerations. Index Term — Text-Mining, Automatic summarization, morphosyntactic analysis.
A Multi-Pronged Approach to Improving Semantic Extraction of News Video
- A. G. Hauptmann; M. -y. Chen; M. Christel; W. -h. Lin; J. Yang
Abstract In this paper we describe a multi-strategy approach to improving semantic extraction from news video. Experi-ments show the value of careful parameter tuning, exploiting multiple feature sets and multilingual linguistic resources, applying text retrieval approaches for image features, and establishing synergy between multiple concepts through undirected graphical models. We present a discriminative learning framework called Multi-concept Discriminative Ran-dom Field (MDRF) for building probabilistic models of video semantic concept detectors by incorporating related concepts as well as the low-level observations. The model exploits the power of discriminative graphical models to simultaneously capture the associations of concept with observed data and...
DEVELOPING WRITING SKILLS THROUGH THE USE OF BLOGS
- Socorro Claudia; Tavares De Sousa; Maria Elias
ABSTRACT: This paper aims at describing an empirical research carried out in a private school in Fortaleza, Brazil, in which the production of a blog was used as an opportunity to teach writing skills in Portuguese, based on the curricular parameters for Brazilian basic education, which postulate that the teaching of Portuguese has to be based on texts. The theoretical background for this study was extracted from Schneuwly and Dolz (2004), who developed the idea of the teaching dimensions of genres. In order to accomplish our research purposes, a group of teachers from a private school in Fortaleza was invited...
INTRODUCING A COMPREHENSIVE APPROACH TO ASSESSING PRONUNCIATION TALENT
- Matthias Jilka; Henrike Baumotte; Natalie Lew; Susanne Reiterer
This paper introduces a comprehensive project with the objective of finding the neural correlates of pronunciation talent. It concentrates on the first part of this undertaking, describing the extensive tests necessary to measure phonetic talent in its various dimensions such as production and perception, the segmental and suprasegmental levels of speech or different utterance forms such as spontaneous speech, reading and imitation. The project also investigates psychological and behavioral influences on pronunciation performance, as well as correlations with general linguistic aptitude. Example tasks making use of the distinction between categorical and realizational differences in intonation are used to demonstrate the...
2002. Making it articulated
- Jason Stanley
Abstract: I argue in favor of the view that all the constituents of the propositions hearers would intuitively believe to be expressed by utterances are the result of assigning values to the elements of the sentence uttered, and combining them in accord with its structure. The way I accomplish this is by questioning the existence of some of the processes that theorists have claimed underlie the provision of constituents to the propositions recovered by hearers in linguistic interpretation, processes that apparently bypass assigning these constituents to elements of the logical form of the expression uttered. Here is the view of...
Linguistic Computing, the Association for Computers and the Humanities and
- Digital Humanities; Elisabeth Burr (allc; Richard Cunningham (sdh-semi; Jan-christoph Meister (allc; Elli Mylonas (ach; Martyn Jessop (publicity; John Lavagnino (publicity; Michael Magruder (performance Str
The Society for Digital Humanities – Société pour l’étude des médias interactif
- Violetta Cavalli-sforza; Jaime G. Carbonell; Peter J. Jansen; Violetta Cavalli-sforza; Jaime G. Carbonell; Peter J. Jansen
We describe ongoing efforts towards developing language resources for a transnational digital government project aimed at applying information technology (IT) to a problem of international concern: detecting and monitoring activities related to the transnational movement of illicit drugs. The project seeks to support information sharing, coordination and collaboration among government agencies within a country and across national boundaries by combining a variety of technologies including a distributed query processor with form-based and conversational user interfaces, a language translation system, an event server for event filtering and notification, and an event-trigger-rule server. The prototype system is being developed by U.S. universities...
THE EUROPEAN FRAMEWORK OF REFERENCE AND L2 LEARNERS WITH A LOW LEVEL OF EDUCATION Anne-Mieke Janssen-van Dieten23
(Council of Europe, 2001) is to achieve more coherence and harmony in the field of languages and comparability of language qualifications within the European Community. The core of the document consists of descriptions of communicative activities and matching descriptions of communicative language competence at six levels. Three types of language users are distinguished: Basic user, Independent user and Proficient user. The levels A1 (Breakthrough) and A2 (Waystage) are sublevels of the basic level. Within the levels of the Independent and Proficient user, sublevels B1 (Threshold) and B2 (Vantage), C1 (Effective proficiency and C2 (Mastery) are distinguished. A basic user is...
R.: Plagiarism meets paraphrasing: Insights for the next generation in automatic plagiarism detection
- Marta Vila; Paolo Rosso
Although paraphrasing is the linguistic mechanism underlying many plagiarism cases, little attention has been paid to its analysis in the framework of automatic plagiarism detection. Therefore, state-of-the-art plagiarism detectors find it difficult to detect cases of paraphrase plagiarism. In this article, we analyze the relationship between paraphrasing and plagiarism, paying special attention to which paraphrase phenomena underlie acts of plagiarism and which of them are detected by plagiarism detection systems. With this aim in mind, we created the P4P corpus, a new resource that uses a paraphrase typology to annotate a subset of the PAN-PC-10 corpus for automatic plagiarism detection....
This thesis is globally the result of ten years of work. I started to reflect on the themes developed in it after I took my first degree in Italy in 1985. None of the actual work done before 1990 is contained in the present thesis, though. However, I hope the reader will forgive the still unsatisfactory treatment of some of the issues this thesis deals with. I would also like to say that a form of the chapter named “The Paradox of Phenomenal Observation ” has, in the meantime, been accepted for publication by the Journal of the British Society...
The development of second-order social cognition and its relation with complex language understanding and memory
- Burcu Arslan; Annette Hohenberger; Rineke Verbrugge
In this study, the development of second-order social cognition and its possible relationship with language and memory were investigated. For this reason two second-order false belief tasks (FBT_2), a short term memory task (WST), a complex working memory task (LST), a linguistic perspective-taking test (PTT) and a double-embedded relative clause task (REL_2) were used with 21 Turkish kindergarten children (aged 4-5 years), 47 primary school children (aged 6-12 years) and 10 adults. A general developmental trend was found for all tasks. However, a multiple linear regression showed that once age was partialed out, none of the other tasks could predict...
A Web-based EFL writing environment: integrating information for learners, teachers, and researchers
- David Wiblea; Chin-hwa Kuob; Feng-yi Chiena; Anne Liua; Nai-lung Tsaob
With the rise in the popularity of web-based education, there is a pressing need for the design of web-based systems that are domain-specific. This need is particularly acute for the domain of second language education, where generic web-based systems fall short of fulfilling the potential of the Internet for meeting the particular challenges faced by language learners and teachers. A novel interactive online environment is described which integrates the potential of computers, Internet, and linguistic analysis to address the highly specific needs of second language composition classes. The system accommodates learners, teachers, and researchers. A crucial consequence of the interactive...
1 LANGUAGE TEACHER TRAINING AND BILINGUAL
- Anne-claude Berthoud; Laurent Gajo
1.1. The national linguistic situation Switzerland has a particularly complex linguistic patchwork because, while on the whole it is a
Russian Academy of Sciences,
- Elena Maslova
2. Polyadic roles and participant sets: a summary of cross-linguistic evidence 3. Polyadic and reciprocal across Bantu 3.1 Event-structure marking in Bantu: an overview 3.2 Distribution of the polyadic meanings
A Grid of Regional Language Archives
- Paul Trilsbeek; Daan Broeder; Tobias Van Valkenhoef; Peter Wittenburg
About two years ago, the Max Planck Institute for Psycholinguistics in Nijmegen, The Netherlands, started an initiative to install regional language archives in various places around the world, particularly in places where a large number of endangered languages exist and are being documented. These digital archives make use of the LAT archiving framework  that the MPI has developed over the past nine years. This framework consists of a number of web-based tools for depositing, organizing and utilizing linguistic resources in a digital archive. The regional archives are in principle autonomous archives, but they can decide to share metadata descriptions...
Advanced Transaction Processing in Multilevel Secure File Stores
- Elisa Bertino; Sushil Jajodia; Senior Member; Luigi Mancini; Indrajit Ray
Abstract—The concurrency control requirements for transaction processing in a multilevel secure file system are different from those in conventional transaction processing systems. In particular, there is the need to coordinate transactions at different security levels avoiding both potential timing covert channels and the starvation of transactions at higher security levels. Suppose a transaction at a lower security level attempts to write a data item that is being read by a transaction at a higher security level. On the one hand, a timing covert channel arises if the transaction at the lower security level is either delayed or aborted by the...
T.: The Danish Dependency Treebank: Linguistic Principles and Semi-automatic Tagging Tools. Paper Represented at the Swedish Treebank Symposium
- Matthias T. Kromann
dependency treebank on top of the morphologically tagged Danish PAROLE corpus (291.000 words). This includes work on: (1) a tagging manual with our recommended dependency analyses of Danish and their underlying linguistic motivation; (2) software that can be used to tag a large corpus (either manually, semi-automatically, or auto-matically), given a dependency-based inheritance lexicon for Danish; and (3) software that allows linguists to search for examples of any particular grammatical construction within the tagged corpus. All three are work in progress (we expect to complete the software and tagging manual in November and the entire corpus in March). We will...
Unsupervised Relation Extraction for Automatic Generation of Multiple-Choice Questions
- Naveed Afzal; Viktor Pekar
In this paper, we investigate an unsupervised approach to Relation Extraction to be applied in the context of automatic generation of multiple-choice questions (MCQs). The approach aims to identify the most important semantic relations in a document without assigning explicit labels to them in order to ensure broad coverage, unrestricted to predefined types of relations. The paper examines three different surface pattern types, each implementing different assumptions about linguistic expression of semantic relations between named entities. Our main findings indicate that the approach is capable of achieving high precision rates and its enhancement with linguistic knowledge helps to produce significantly...
Song: Biomedical Ontologies and Text Mining for Biomedicine and Healthcare: A Survey
- Illhoi Yoo; Min Song
In this survey paper, we discuss biomedical ontologies and major text mining techniques applied to biomedicine and healthcare. Biomedical ontologies such as UMLS are currently being adopted in text mining approaches because they provide domain knowledge for text mining approaches. In addition, biomedical ontologies enable us to resolve many linguistic problems when text mining approaches handle biomedical literature. As the first example of text mining, document clustering is surveyed. Because a document set is normally multiple-topic, text mining approaches use document clustering as a preprocessing step to group similar documents. Additionally, document clustering is able to inform the biomedical literature...