elasticsearch - Will the Crawler reindex the records after deleted
问题描述
Working on Storm Crawler 1.12.1 and Elastic Search 6.5.2. I need to increase the efficiency of my search engine. I deleted some of the documents for security reasons after indexing documents into the elastic search. So my question is that the storm crawler will re grab the deleted urls and re-index again? I don't want to re-crawl the deleted records,How can I achieve this?
解决方案
I assume you deleted the documents from the content index. They are probably still in the status index and even if they are not, they might be rediscovered and added back.
The best thing to do would be to add new entries to whichever flavour of URLfilters you are using so that these URLs are covered, this way they won't be added back if rediscovered then delete them from the status index.
推荐阅读
- python - python中自定义类对象的参数传递:按引用调用还是按值调用评估?
- java - Java array.sort(arr) 输出 0 而不是 arr
- c# - 同时运行异步和同步方法
- python - 在字典中查找相似的列表值
- php - 如何完成在文件夹中显示文件的搜索表单的代码?
- javascript - 来自 node-js mysql 连接的错误未保存到数组
- apache - Apache说配置了虚拟主机,但我找不到
- java - 通用方法的 JUnit 测试用例
- java - 用完短值来检测 LIBGDX 中的冲突
- ruby - 使用 Azure 数据工厂 v2 在本地服务器上调用 Ruby 脚本或可执行文件?