1、下载ingest-attachment
2、下载分词器插件ik
3、在kibana Dev Tools执行
(1)、定义文本抽取管道
PUT /_ingest/pipeline/attachment
{
"description": "Extract attachment information",
"processors": [
{
"attachment": {
"field": "content",
"ignore_missing": true
}
},
{
"remove": {
"field": "content"
}
}
]
}
(2)、建立文档结构映射
PUT /docwrite
{
"mappings": {
"properties": {
"id":{
"type": "keyword"
},
"name":{
"type": "text",
"analyzer": "ik_max_word"
},
"type":{
"type": "keyword"
},
"attachment": {
"properties": {
"content":{
"type": "text",
"analyzer": "ik_smart"
}
}
}
}
}
}
4、注意事项java代码需要把非数据性结构文档转化成base64进行处理
备注:请自行安装7.9.1版本的es和kibana
没有合适的资源?快使用搜索试试~ 我知道了~
springboot+es实现对word,pdf,txt等文件的非结构化数据全文内容检索
共220个文件
xml:131个
sample:11个
class:8个
3星 · 超过75%的资源 需积分: 39 94 下载量 168 浏览量
2021-06-11
15:52:18
上传
评论 7
收藏 261KB ZIP 举报
温馨提示
使用spring boot+Elasticsearch 7.9.1+kibana 实现对word,pdf,txt等文件的非结构化数据全文内容检索
资源详情
资源评论
资源推荐
收起资源包目录
springboot+es实现对word,pdf,txt等文件的非结构化数据全文内容检索 (220个子文件)
04c3105ecc746880298492628a8e8b8de1ca5a 58B
121363a2e5db122df6e6379132ce8defc18042 486B
1c1331d04835513cf036b69d3db04e5f4f49e1 73B
2e2b71cc7c102a294ac9d28fed47c9d4227b7d 85B
37ce78d1e19954a015df6ead67950741583694 1KB
3a66fb3601f50bde7c23dc4cf32a9a3a1bc13e 178B
3b084851a85ba91ab7fc45d77fbb486070f07e 434B
3b9d6c64cabd48c9e5c2e1c8ad7f0038fcc9ba 195B
41731d5f1602ebe15e15454908104365d9fefc 1KB
42de8fe871bc03082390d4dae6c6141bcb4465 748B
4d9a655ed577eeb4c2285ff8a55d5bc9cb00e4 986B
4e41b9cd5e4af06d3f207bc659e27918b3e72a 47B
4fae42eca336d328ccb48b2345893225ab0bd7 89B
54d47c9cfe4f4a3cf1d671786568b9675903c3 1KB
6a67c7fb30991431d681b9a2491e4a4d30ae55 55B
6bab9ad19260876ca18a636b8037c02cc1b06a 964B
6e39268616b063372874e4308a87912d7626e8 47B
70d98f3a28580e3be84ad6b36a92a0df3a4c57 4KB
73bb50d56e8c19282593cbf5b081e211923a83 95B
74320534f702c95a4869eb9ce80c6c0fde5f08 66B
7e438b9c186a30c012b176a88093b00c837cfb 458B
8d8c28acb6b7bb74aaf6e4828c439de28728c4 47B
9bd74d766ebd4c033528112148d866555b5c9e 2KB
b6b3a28bf29cf1ae46e1d50b43380628540fff 1KB
b77019e04d5f99a479b053ba472fe1cfa81cd2 199B
bbfb60a07b9ca1e650bf9698baed5fc15b58ff 45B
bc6c993460c5e2d08bf8a7cb4eea9abcbdd804 365B
be6ca472542efb6e8c402ebd730c6e02f935db 79B
c84ea9b4d95453115d0c26488d6a78694e0bc6 40KB
cb56c458214cf1175c0d6ac8b2cb355646b9ae 1KB
cf7efa49f5632442cfa1ee6ec424075242f6bc 408B
ElasticsearchUtil.class 19KB
EsController.class 9KB
Swagger2Config.class 6KB
EsPage.class 4KB
ESConfig.class 3KB
FileObj.class 3KB
FileUploadConfiuration.class 900B
SpringbootElasticsearchApplication.class 809B
mvnw.cmd 5KB
config 317B
d07fc7627a355e1d230a14c1141fb8e635202e 201B
d544fb7fd2036c84b2ac39da12a1ca3f220386 74B
dee0a815476a547dfcfd1aa5dcd8286697471a 4KB
description 73B
dfad641911851c64a176cdcc2b73389bc6bbbb 95B
e1b1d84d8ec2b3cbe0ffac112515a3fa37fc58 965B
e6b24ed922ed21ad251765290e2b185fd7c164 49B
ea9b21342333f6cf063708f74bdca997b7b816 135B
eca336e352c9026decda294ff678968050edfc 198B
exclude 240B
f251c0774593ca4f5335acf0f7483eaa162e8f 3KB
f2b33a301424dd54f54fc6f97685dba5fb5bb2 45B
f952e8292b349ce6623b20a38583dba1caf0a5 324B
.gitignore 292B
HEAD 193B
HEAD 193B
HEAD 32B
HEAD 23B
springboot-elasticsearch.iml 14KB
index 1KB
maven-wrapper.jar 46KB
ElasticsearchUtil.java 18KB
EsController.java 10KB
Swagger2Config.java 4KB
ESConfig.java 2KB
EsPage.java 2KB
FileUploadConfiuration.java 691B
SpringbootElasticsearchApplication.java 492B
FileObj.java 280B
master 193B
master 41B
mvnw 7KB
packed-refs 114B
application.properties 205B
application.properties 205B
maven-wrapper.properties 111B
pre-rebase.sample 5KB
update.sample 4KB
fsmonitor-watchman.sample 3KB
pre-commit.sample 2KB
prepare-commit-msg.sample 1KB
pre-push.sample 1KB
commit-msg.sample 896B
pre-receive.sample 544B
applypatch-msg.sample 478B
pre-applypatch.sample 424B
post-update.sample 189B
es服务配置非数据性结构.txt 1KB
workspace.xml 47KB
pom.xml 4KB
Maven__org_springframework_boot_spring_boot_test_autoconfigure_1_5_1_RELEASE.xml 781B
Maven__org_elasticsearch_client_elasticsearch_rest_high_level_client_7_9_1.xml 767B
Maven__org_springframework_boot_spring_boot_starter_logging_1_5_1_RELEASE.xml 760B
Maven__org_springframework_boot_spring_boot_starter_tomcat_1_5_1_RELEASE.xml 753B
Maven__org_springframework_boot_spring_boot_starter_log4j2_1_5_1_RELEASE.xml 753B
Maven__org_springframework_boot_spring_boot_autoconfigure_1_5_1_RELEASE.xml 746B
Maven__org_springframework_boot_spring_boot_starter_test_1_5_1_RELEASE.xml 739B
Maven__org_springframework_plugin_spring_plugin_metadata_1_2_0_RELEASE.xml 733B
Maven__org_springframework_boot_spring_boot_starter_web_1_5_1_RELEASE.xml 732B
共 220 条
- 1
- 2
- 3
sinat_35208367
- 粉丝: 0
- 资源: 4
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论2