The Parquet Writer Module

The Parquet Writer module is designed to create files from incoming arrays. Due to its efficient compression, parquet is a commonly used format for saving large chunks of data and uploading the created files to external systems, such as AWS S3, Azure DataLake and many more. The module will detect the schema to use based on all objects within the incoming array and the datatype per column based on the first object. By default the module assumes that all rows have the exact same datatype, which might not be true every time. To overcome those cases, you can set schema overrides and also ignore values of the wrong data type. You can read more about this in the module documentation.

The filename can either be defined with a base filename + timestamp or picked up from the incoming message using template syntax.

Example input:

Crosser Data Generator_Example

Parquet Viewer:

Crosser Example_Parquet Viewer Module

Download the PDF

Get the Complete Guide to Crosser Module Library

Download the comprehensive overview of the Crosser Module library to learn more about Crosser's functionality.

Get the Guide Here

16 May 2025

Tips & Tricks

About the author

David Nienhaus | Senior Solution Engineer

David is a Senior Solution Engineer at Crosser. He has over 15 years experience working with software integration and digitization projects for critical infrastructure.
His engineering background gives him the understanding and focus needed to solve customer use cases in the most efficient and successful way.

Search Crosser Knowledge Base

The Parquet Writer Module