Assume we have 4 process (rank 0 - rank 3), and rank 0 has the data = [1,2,3,4,5,6]. What I wanna do is
from:
rank 0: [1,2,3,4,5,6], rank 1: [], rank 2: [], rank 3: []
to :
rank 0: [], rank 1: [1,2], rank 2: [3,4], rank 3: [5,6]
basically, rank 0 dispatches data to all others and rank 0 will not be assigned data. I realise there is similar question that can be addressed with MPI_IN_PLACE. I tried but didn't get what I want.