jeudi 23 avril 2015

what is the fastest function in RDD spark

I'm implementing GroupBy function and it is "transformations" operation.

I need that the my GroupBy function must be computed immediately, so I've found out a solution that calling another "action" likes first() or count() operation after GroupBy then it will be computed.

The running time of GroupBy is equal its + the action operation, and thus I need a fastest function to minimum total running time!!


Aucun commentaire:

Enregistrer un commentaire