dplutils.pipeline.stream.StreamingGraphExecutor.split_batch_submit#
- abstractmethod StreamingGraphExecutor.split_batch_submit(batch: StreamBatch, max_rows: int) Any[source]#
Submit a task to split batch into at most
max_rowsSimilart to task_submit, implementations should arrange by whatever means make sense to take the dataframe reference in
batch.dataofStreamBatch, given its length inbatch.lengthand split into a number of parts that result in no more thanmax_rowsper part. The return value should be a list of objects that can be processed byis_task_ready()andtask_resolve_output().