In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-09-21 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Internet Technology >
Share
Shulou(Shulou.com)06/01 Report--
This article is about how to use Elasticsearch to build a knowledge base retrieval system, the editor thinks it is very practical, so I share it with you to learn. I hope you can get something after reading this article.
1. How are documents of pdf and Office indexed by ES?
More specifically, how pdf, Office class documents (word,ppt,excel, etc.) are imported into ES. As shown in the figure: the question turns to: how to import Office documents and PDF documents into ES for indexing and provide full-text retrieval services?
2. What is the maximum length of fields to be retrieved by Elasticsearch?
After the ES5.X version, keyword supports a maximum length of 32766 UTF-8 characters, and text has no limit on character length. When ignore_above is set, data beyond the given length will not be indexed and cannot be retrieved through term exact matching.
Reference: http://t.cn/RYlEMgn
Refer to 6. 0 official website to interpret: http://t.cn/RYlELoV
Reference luncene7.1API: http://t.cn/RYWvuGl
3. Office&pdf documents are stored in Elastisearch attention list.
Cut the crap and go straight to the picture.
4. Analyze the actual combat code
From a functional and performance point of view, the recommendations for the establishment of a knowledge base are as follows: 1) the core of the knowledge base is to import data into ES, and the core of importing ES is the parsing of various types of documents; 2) set Mapping in advance and define the strategy of field word segmentation and non-word segmentation; 3) for the storage of a field greater than 1MB, it is recommended to use fvh highlight mode, which is set together in Mapping.
The above is how to use Elasticsearch to build a knowledge base retrieval system, the editor believes that there are some knowledge points that we may see or use in our daily work. I hope you can learn more from this article. For more details, please follow the industry information channel.
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
The market share of Chrome browser on the desktop has exceeded 70%, and users are complaining about
The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.According to a r
A US federal judge has ruled that Google can keep its Chrome browser, but it will be prohibited from
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
About us Contact us Product review car news thenatureplanet
More Form oMedia: AutoTimes. Bestcoffee. SL News. Jarebook. Coffee Hunters. Sundaily. Modezone. NNB. Coffee. Game News. FrontStreet. GGAMEN
© 2024 shulou.com SLNews company. All rights reserved.