If unspecified, the default behavior is TURN_INCLUDES_ONLY_ACTIVITY.
The users turn includes all realtime input since the last turn, including inactivity (e.g. silence on the audio stream).
Includes audio activity and all video since the last turn. With automatic activity detection, audio activity means speech and excludes silence.
The users turn only includes activity since the last turn, excluding inactivity (e.g. silence on the audio stream). This is the default behavior.
Options about which input is included in the user's turn.