Substrait Modifications in Gluten
Substrait is a project aiming to create a well-defined, cross-language specification for data compute operations. Since it is still under active development, there are some lacking representations for Gluten needed computing operations. At the same time, some existing representations need to be modified a bit to satisfy the needs of computing.
In Gluten, the base version of Substrait is v0.23.0. This page records all the Gluten changes to Substrait proto files for reference. It is preferred to upstream these changes to Substrait, but for those cannot be upstreamed, alternatives like AdvancedExtension could be considered.
Modifications to algebra.proto
- Added
JsonReadOptionsandTextReadOptionsinFileOrFiles(#1584). - Changed join type
JOIN_TYPE_SEMItoJOIN_TYPE_LEFT_SEMIandJOIN_TYPE_RIGHT_SEMI(#408). - Added
WindowRel, addedcolumn_nameandwindow_typeinWindowFunction, changedUnboundedinWindowFunctionintoUnbounded_PrecedingandUnbounded_Following, and added WindowType(#485). - Added
output_schemain RelRoot(#1901). - Added
ExpandRel(#1361). - Added
GenerateRel(#574). - Added
PartitionColumninLocalFiles(#2405). - Added
WriteRel(#3690).