pd_shuffledata / randomize dataframeYou can shuffle the rows of a dataframe by indexing with a shuffled index. For this, you can eg use np.

8/16/2017 - 5:49 PM

pd_shuffledata / randomize dataframe You can shuffle the rows of a dataframe by indexing with a shuffled index. For this, you can eg use np.

pd_shuffledata / randomize dataframe You can shuffle the rows of a dataframe by indexing with a shuffled index. For this, you can eg use np.random.permutation (but np.random.choice is also a possibility):

pd_shuffledata.py

In [12]: df = pd.read_csv(StringIO(s), sep="\s+")

In [13]: df
Out[13]: 
    Col1  Col2  Col3  Type
0      1     2     3     1
1      4     5     6     1
20     7     8     9     2
21    10    11    12     2
45    13    14    15     3
46    16    17    18     3

In [14]: df.iloc[np.random.permutation(len(df))]
Out[14]: 
    Col1  Col2  Col3  Type
46    16    17    18     3
45    13    14    15     3
20     7     8     9     2
0      1     2     3     1
1      4     5     6     1
21    10    11    12     2

If you want to keep the index numbered from 1, 2, .., n as in your example
 you can simply reset the index: 
df_shuffled.reset_index(drop=True)

Cacher is the code snippet organizer for pro developers

We empower you and your team to get more done, faster

pd_shuffledata / randomize dataframe You can shuffle the rows of a dataframe by indexing with a shuffled index. For this, you can eg use np.