Wednesday, July 23, 2014

 

 



Soy un nuevo usuario

Olvidé mi contraseña

Entrada usuarios

Lógica Matemáticas Astronomía y Astrofísica Física Química Ciencias de la Vida
Ciencias de la Tierra y Espacio Ciencias Agrarias Ciencias Médicas Ciencias Tecnológicas Antropología Demografía
Ciencias Económicas Geografía Historia Ciencias Jurídicas y Derecho Lingüística Pedagogía
Ciencia Política Psicología Artes y Letras Sociología Ética Filosofía


Near-Duplicate Detection Using Instance Level Constraints

1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario
    registrado en Universia

  Descargar recurso   Descargar recurso

Detalles del recurso

Pertenece a: ETD at Indian Institute of Science  

Descripción: For the task of near-duplicate document detection, comparison approaches based on bag-of-words used in information retrieval community are not sufficiently accurate. This work presents novel approach when instance-level constraints are given for documents and it is needed to retrieve them, given new query document for near-duplicate detection. The framework incorporates instance-level constraints and clusters documents into groups using novel clustering approach Grouped Latent Dirichlet Allocation (gLDA). Then distance metric is learned for each cluster using large margin nearest neighbor algorithm and finally ranked documents for given new unknown document using learnt distance metrics. The variety of experimental results on various datasets demonstrate that our clustering method (gLDA with side constraints) performs better than other clustering methods and the overall approach outperforms other near-duplicate detection algorithms.

Autor(es): Patel, Vishal - 

Id.: 54390552

Idioma: English (United States)  - 

Versión: 1.0

Estado: Final

Palabras claveDocument Clustering  -  Artificial Intelligence - 

Tipo de recurso: Thesis  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

Requerimientos técnicos:  Browser: Any - 

Relación: [References] G23536

Fecha de contribución: 10-ago-2011

Contacto:

Localización:


Otros recursos del mismo autor(es)

  1. Delayed presentation of a loose body in undisplaced paediatric talar neck fracture Fractures of the talus are rare in children. A high index of suspicion is needed to avoid missing su...
  2. Cycles in spatial and temporal chromosomal organization driven by the circadian clock Dynamic transitions in the epigenome have been associated with regulated patterns of nuclear organiz...
  3. Muscle insulin sensitivity and glucose metabolism are controlled by the intrinsic muscle clock★ Circadian rhythms control metabolism and energy homeostasis, but the role of the skeletal muscle clo...
  4. miR-17∼92 miRNA cluster promotes kidney cyst growth in polycystic kidney disease Polycystic kidney disease (PKD), the most common genetic cause of chronic kidney failure, is charact...
  5. Mapping the Structural Topology of the Yeast 19S Proteasomal Regulatory Particle Using Chemical Cross-linking and Probabilistic Modeling* Structural characterization of proteasome complexes is an essential step toward understanding the ub...

Otros recursos de la misma colección

  1. An Algorithmic Characterization Of Polynomial Functions Over Zpn The problem of polynomial representability of functions is central to many branches of mathematics. ...
  2. Reliability Modelling Of Whole RAID Storage Subsystems Reliability modelling of RAID storage systems with its various components such as RAID controllers, ...
  3. Design Of Truthful Allocation Mechanisms For Carbon Footprint Reduction Global warming is currently a major challenge faced by the world. Reduction of carbon emissions is ...
  4. Boxicity And Cubicity : A Study On Special Classes Of Graphs Let F be a family of sets. A graph G is an intersection graph of sets from the family F if there exi...
  5. Scaling Context-Sensitive Points-To Analysis Pointer analysis is one of the key static analyses during compilation. The efficiency of several com...

Valoración de los usuarios

No hay ninguna valoración para este recurso.Sea el primero en valorar este recurso.
 

Busque un recurso