Elasticsearch: Retrieve the document with the attachment content
This documentation is part of the Ingest attachment plugin guide. View the full guide here: How to extract data from PPT, XLS, and PDF files to Elasticsearch.
👋 Welcome to the Stackhero documentation!
Stackhero offers a ready-to-use Elasticsearch cloud solution that provides a host of benefits, including:
- Optimal performance and robust security powered by a private and dedicated VM.
- Customizable domain name secured with HTTPS encryption support.
Save time and simplify your life: it only takes 5 minutes to try Stackhero's Elasticsearch cloud hosting solution!
To view the processed document, retrieve it using its ID:
GET my_index/_doc/my_id
The response should look similar to the following:
{
"_index" : "my_index",
"_type" : "_doc",
"_id" : "my_id",
"_version" : 1,
"found" : true,
"_source" : {
"data" : "e1xydGYxXGFuc2kKVGhpcyBpcyB0aGUgY29udGVudCBvZiBhIFJURiBmaWxlClxwYXIgfQ==",
"attachment" : {
"content_type" : "application/rtf",
"language" : "en",
"content" : "This is the content of a RTF file",
"content_length" : 35
}
}
}
Notice that the _source field now includes both the original Base64 data and the extracted attachment details such as file type and content.