Home:ALL Converter>Trying to group out data and write them out to files

Trying to group out data and write them out to files

Ask Time:2021-09-22T05:41:13         Author:asuscondo

Json Formatter

I was wondering if anyone knew the proper way to write out a group of files based on the value of a column in Dask. In other words, if I want to group a bunch of columns based on a value in a column and write those out to CSVs. I've been trying to use the groupby-apply paradigm with Dask, but the problem is that it does not return a dask.dataframe object, so the function I apply it with uses the Pandas API.

Is there a better way to approach what I'm trying to do? A scalable solution would be much appreciated because some of the data that I'm dealing with is very large.


Author:asuscondo,eproduced under the CC 4.0 BY-SA copyright license with a link to the original source and this disclaimer.
Link to original article:https://stackoverflow.com/questions/69275854/trying-to-group-out-data-and-write-them-out-to-files