Elasticsearch: Retrieve the document with the attachment content

This documentation is part of the Ingest attachment plugin guide. View the full guide here: How to extract data from PPT, XLS, and PDF files to Elasticsearch.

👋 Welcome to the Stackhero documentation!

Stackhero offers a ready-to-use Elasticsearch cloud solution that provides a host of benefits, including:

  • Optimal performance and robust security powered by a private and dedicated VM.
  • Customizable domain name secured with HTTPS encryption support.

Save time and simplify your life: it only takes 5 minutes to try Stackhero's Elasticsearch cloud hosting solution!

To view the processed document, retrieve it using its ID:

GET my_index/_doc/my_id

The response should look similar to the following:

{
  "_index" : "my_index",
  "_type" : "_doc",
  "_id" : "my_id",
  "_version" : 1,
  "found" : true,
  "_source" : {
    "data" : "e1xydGYxXGFuc2kKVGhpcyBpcyB0aGUgY29udGVudCBvZiBhIFJURiBmaWxlClxwYXIgfQ==",
    "attachment" : {
      "content_type" : "application/rtf",
      "language" : "en",
      "content" : "This is the content of a RTF file",
      "content_length" : 35
    }
  }
}

Notice that the _source field now includes both the original Base64 data and the extracted attachment details such as file type and content.