Class ParquetDataSourcePlugin

java.lang.Object
org.elasticsearch.plugins.Plugin
org.elasticsearch.xpack.esql.datasource.parquet.ParquetDataSourcePlugin
All Implemented Interfaces:
Closeable, AutoCloseable, DataSourcePlugin

public class ParquetDataSourcePlugin extends Plugin implements DataSourcePlugin
Data source plugin that provides Parquet format support for ESQL external data sources.

This plugin provides:

  • Parquet format reader for reading Parquet files from any storage provider

The Parquet format reader uses Apache Parquet's native ParquetFileReader with Iceberg's schema conversion utilities. It supports:

  • Schema discovery from Parquet file metadata
  • Column projection for efficient reads
  • Batch reading with configurable batch sizes
  • Direct conversion to ESQL Page format

Heavy dependencies (Parquet, Hadoop, Iceberg, Arrow) are isolated in this module to avoid jar hell issues in the core ESQL plugin.