Top-p Sampling, also called nucleus sampling, is a text generation method that selects the next token from the smallest group of likely candidates whose combined probability exceeds a specified threshold.
Voice AI platforms use Top-p Sampling to balance response quality, creativity, and consistency. It is commonly adjusted alongside Temperature to optimize conversational experiences for different business use cases.