当段落包含带有Elasticsearch索引的句子时进行匹配

问题描述 投票:0回答:1

我使用elasticsearch创建一个程序,该程序可以查找文本中所有引用圣经的地方以及所提到的经文所在的地方我在elasticsearch中索引了圣经的所有经节,每节经文都是一个文档当我通过部分输入经文进行搜索时,我会找到正确的结果(即使是犯错误)如何浏览文本以找到引用经文(甚至部分经文)的所有情况,从而将经文的来源归因于它们?和容忍错误(使用模糊性参数或使用我认为的同义词)

我的索引示例:

{"index":{"_index":"test","_type":"","_id":1}}
{"fields":{"year":3560,"book":"1","chapter":1,"section":1,"text":"others words consectetur adipiscing and others words"},"id":"test1","type":"add"}
{"index":{"_index":"test","_type":"","_id":2}}
{"fields":{"year":3560,"book":"2","chapter":3,"section":2,"text":"others words a sagittis nisl quam and others words"},"id":"test2","type":"add"}
{"index":{"_index":"test","_type":"","_id":3}}
{"fields":{"year":3560,"book":"3","chapter":1,"section":5,"text":"others words Aliquam ultrices auctor pharetra and others words"},"id":"test3","type":"add"}
{"index":{"_index":"test","_type":"","_id":4}}
{"fields":{"year":3560,"book":"4","chapter":2,"section":4,"text":"others words Proin ut vestibulum and others words"},"id":"test4","type":"add"}
{"index":{"_index":"test","_type":"","_id":5}}
{"fields":{"year":3560,"book":"5","chapter":1,"section":5,"text":"others words Aenean pretium tincidunt aliquet and others words"},"id":"test5","type":"add"}
{"index":{"_index":"test","_type":"","_id":6}}
{"fields":{"year":3560,"book":"6","chapter":2,"section":1,"text":"others words In vitae sagittis and others words"},"id":"test6","type":"add"}
{"index":{"_index":"test","_type":"","_id":7}}
{"fields":{"year":3560,"book":"7","chapter":7,"section":7,"text":"others words ligula laoreet pharetra and others words"},"id":"test7","type":"add"}
{"index":{"_index":"test","_type":"","_id":8}}
{"fields":{"year":3560,"book":"8","chapter":1,"section":4,"text":"others words luctus eros a pretium and others words"},"id":"test8","type":"add"}
{"index":{"_index":"test","_type":"","_id":9}}
{"fields":{"year":3560,"book":"9","chapter":1,"section":7,"text":"others words ullamcorper eu id quam and others words"},"id":"test9","type":"add"}
{"index":{"_index":"test","_type":"","_id":10}}
{"fields":{"year":3560,"book":"10","chapter":5,"section":4,"text":"others words Nullam ac enim ac lacus hendrerit and others words"},"id":"test10","type":"add"}

为了恢复其来源,我需要找到该段落中所有在索引中的出现:

Lorem ipsum dolor sit amet,consectetur adipiscing elit。 Nulla rhoncus,nulla vitae porta euismod,purus nisl faucibus nunc,sagittis nisl quam id arcu。塞德·阿梅特·阿塞克(Sed) Proin utvesibulum sem,id rutrum felis。菜豆矢状不育,结果是败类。 Quisque dapibus finibus euismod。 Proin dui urna,auctor ut gravida quis,fringilla quis velit。 Donec sed pulvinar leo。 Sed脉管相视点。毛里斯非dapibus直径。 Pellentesque quis pellentesque libero。Aliquam uluces auctor pharetra。 Cras ullamcorper,dioo坐在amet aliquam convallis,magna nibh gravida nunc,坐在amet volutpat elit purus eget lectus。 Pellentesque eu est risus euismod后果。 Duis id erat porttitor,索达莱斯·贾斯托·诺,阿里凯特(alquet)。 Etiam tincidunt neque ut nisi commodo auctor。 Sed congue urna actellus scelerisque hendrerit。毛uri(Mauris lobortis sed dui utus varius)。Proin ac luctus felis。 矢状弓形]擦除,nec luctus sapien。 Aenean pretium tincidunt aliquet。 Enim vel ligula laoreet pharetra的Morbi。 Sed dignissim luctus eros a pretium。 Vestibulum痣痣nisi,vitae scelerisque nibh bibendum nec。 Donec Laoreet Sapien sed vehicula dictum。 Nullam ac enim ac lacus hendrerit tempor et vitae neque。狮子座贵族的魁北克,augue vitae,congue eros。 Maecenas volutpat ante nec scelerisque前庭。Donec tristique orci erat,未作废的商品。 Nam odio vel quam cursus ullamcorper eu id quam。 Duis volutpat,尼古拉·穆迪斯·马蒂斯,augue ipsum mollis leo,eget efficitur orci augue eget leo。整型facilisis dolor ut vehicula。 Maecenas quis feugiat massa。 Curabitur feugiat dioo eget ligula tincidunt sodales。 Donec feugiat dapibus lectus,非最大履历。菜豆,马鞭草,阿利凯特·梅斯。

我不知道我是否足够清楚,但请随时询问我是否需要更高的精度

我认为此问题由Aho-Corasick算法处理,但我不知道如何将其集成到Elasticsearch中

谢谢!

[我使用elasticsearch创建一个程序,该程序允许查找文本中所有引用圣经的地方以及所提到的经文的位置,我在...中为圣经的所有经文建立了索引]]

algorithm elasticsearch search text full-text-search
1个回答
0
投票
如果我能够正确理解您的问题,那么您正在寻找的只是能够做到的

“某些部分经文:查询
© www.soinside.com 2019 - 2024. All rights reserved.