设置参数 `node.name` 可以自定义 Elasticsearch 节点的名字。 此条 Tips 由 medcl 贡献。

ElasticSearch pinyin搜索出来的中文不能高亮

Elasticsearch | 作者 tenyears | 发布于2019年02月22日 | 阅读数:2122

 
kibana pinyin搜索的
GET danganku/archive_files_index/_search
{
"query" : {
"match" : {
"filename.pinyin": {
"query": "dizhen"
}
}},
"highlight": {
"fields": {
"filename.pinyin": {}

}
}
}
结果

"_index": "danganku",
"_type": "archive_files_index",
"_id": "8",
"_score": 0.005596721,
"_source": {
"path": "archiveFiles/temp/11/1238",
"sdate": 2019,
"filename": "中国地震台",
"artist": "央视记者",
"filecontent": "据中国地震台网正式测定:核实中。",
"id": 8,
"state": 0,
"userid": 8
},
"highlight": {
"filename.pinyin": [
"中国地震台"
]
}
},


kibana ik分词搜索

GET danganku/archive_files_index/_search
{
"query" : {
"match" : {
"filename.ik": {
"query": "地震"
}



}},
"highlight": {
"fields": {
"filename.ik": {}

}
}
}
结果
"_source": {
"path": "archiveFiles/temp/11/1235",
"sdate": 2019,
"filename": "中国地震台",
"artist": "央视记者",
"filecontent": "据中国地震台网正式测定:
核实中
",
"id": 5,
"state": 0,
"userid": 5
},
"highlight": {
"filename.ik": [
"中国<em>地震</em>台"
]
}
这是我的设置

	"settings": 
{
"index" : {
"analysis" : {
"analyzer" : {
"pinyin_analyzer" : {
"tokenizer" : "pinyin"
}
},
"tokenizer" : {
"pinyin" : {
"type" : "pinyin",
"keep_separate_first_letter" : false,
"keep_full_pinyin" : true,
"keep_original" : true,
"limit_first_letter_length" : 16,
"lowercase" : true,
"remove_duplicated_term" : true,
"ignore_pinyin_offset":false
}}}}
},
"mappings": {
"archive_files_index": {
"properties": {
"id": {
"type": "integer"
},

"groupid": {
"type": "integer",
"index": false
},

"path": {
"type": "keyword",
"index": false

},
"filename": {
"type": "text",
"fields": {
"ik": {
"analyzer": "ik_max_word",
"search_analyzer": "ik_max_word",
"type": "text"
},
"pinyin": {
"analyzer": "pinyin",
"search_analyzer": "pinyin",
"type": "text"
}

}
}}
求大佬指点

已邀请:

要回复问题请先登录注册