This question is in the context of tuning a training procedure, whereby the learner may receive random stickers for good performance. I am trying to figure out the probability of any given learner maximizing their reward by collecting every possible sticker type. This information will be used to adjust the number of sticker types.
A single learner plays 60 games. Each game has 100 trials. Every time the learner succeeds in a trial, there is a 4% chance that the learner will receive a random sticker. There are an unlimited number of stickers, there are $ n $ sticker types, there is a uniform probability distribution for selecting any given sticker type, and repeat sticker types are possible. (E.g., Bob won two Batman stickers in a row, but he instead wanted a Batman sticker and a Superman sticker.)
Question: Assuming the learner succeeds in every trial, what is the probability that, at the end of the 60th game, the learner will have collected at least one of each of the $ n $ sticker types?
My thinking thus far:
- The problem is equivalent to a single game with 6000 trials.
- The probability of winning any sticker after one trial is $\frac{1}{25}$, or 4%.
- The probability of winning a specific sticker type after one trial is $ \frac{1}{25n} $, or $ \frac{0.04}{n} $.
- $ p_{max} = 1 - p_{fail} $
- The probability of only getting $ i $ unique sticker types after 6000 trials is $p_{i}$.
- $p_{fail} = \sum_{i=0}^{n-1} p_{i}$
- $p_{0} = 0.04^{6000}$ (the probability of not winning any stickers at all after 6000 trials)
At this point I am not sure how to proceed.