Or did you mean for traffic scene understanding?
Datasets like RHD (Rendered Hand Pose Dataset) and OneHand10K are used for training AI models in this field.
Research in HaDR discusses using Stable Diffusion with ControlNet to generate realistic hand images from pose data.