信息检索：基于知识图谱和深度学习的文本表示和搜索(Explicit and distributed semantics for text representation and retrieval)-白红宇

信息检索：基于知识图谱和深度学习的文本表示和搜索(Explicit and distributed semantics for text representation and retrieval)

阅读量：4069 次

发布时间：2019-05-25

本文共 569 字，大约阅读时间需要 1 分钟。

Language Technologies Institute - Carnegie Mellon University - Chenyan Xiong

本篇博文是根据论文作者分享讲座整理，主要介绍了作者用知识图谱和分布式表示扩展语义信息来做信息检索的工作。(查询扩展的延伸)

引子

在信息检索中，文本大多数是由词袋模型来表示的。包括Query和Document

词袋模型：词向量空间里的离散的维度。当代搜索引擎的一个根基。

模型：BM25，LM，Learn2Rank

特征：TF， IDF，etc

问题：Vocabulary Mismatch

缺点：No Semantics, No Understanding, relies on a lot of feature engineering, 只是利用了统计特征

Focusing Ad hoc Search Task

Two ways to overcome the limitation of bag-of-words

Knowledge graph: Introduing explicit semantics from kownledge graph to search

Deep learning: Learn distributed semantics end-to-end

转载地址：http://unoji.baihongyu.com/

你可能感兴趣的文章

Android（三）数据存储之XML解析技术

查看>>

Spring JTA应用之JOTM配置

查看>>

spring JdbcTemplate 的若干问题

Centos 7（Linux）环境下安装PHP（编译添加）相应动态扩展模块so（以openssl.so为例）

查看>>

fastcgi_param 详解

查看>>

autohotkey快捷键显示隐藏文件和文件扩展名

学习设计模式（3）——单例模式和类的成员函数中的静态变量的作用域

查看>>

自然计算时间复杂度杂谈

查看>>

使用 Springboot 对 Kettle 进行调度开发

慢慢欣赏linux make uImage流程

查看>>

linux内核学习(7)脱胎换骨解压缩的内核