Sample Maximum Possible Data Points From Distribution To New Distribution

September 16, 2024 Post a Comment

Solution 1:

You can try calculate the maximal total count for each week, then multiply that with the desired distribution. The idea is

Devide the Count by Desired Distribution to get the possible total
Calculate the minimal possible total for each week with groupby
Then multiply the possible totals with the Desired Distribution to get the sample numbers.

In code:

df['new_count'] = (df['Count'].div(df['Desired Distribution'])
    .groupby(df['Week']).transform('min')
    .mul(df['Desired Distribution'])
    //1
).astype(int)

Output:

   Week Class  Count  Distribution  Desired Distribution  new_count
01A9540.360.5595411B5540.210.2950321     C   11450.430.1627732A4540.210.5545442B9440.440.2923952     C    7480.350.16132

Python Guru

Sample Maximum Possible Data Points From Distribution To New Distribution

Solution 1:

Post a Comment for "Sample Maximum Possible Data Points From Distribution To New Distribution"