Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s of nodes. Use for batch inference, data preprocessing, multi-modal data loading, or distributed ETL pipelines.
/plugin marketplace add zechenzhangAGI/AI-research-SKILLs/plugin install ray-data@zechenzhangAGI/AI-research-SKILLs