Yes, it is possible to cue animations with sound in Spine. You can use audio event keys to play audio during the animation. For more information, you can refer to the Events page of the Spine User Guide. However, Spine does not natively support generating animations from motion capture data. While there have been discussions about importing motion capture data, it is considered a high-effort feature and not currently supported due to the complexity and potential issues with mapping 3D data to 2D animations.
For lip-syncing, tools like Rhubarb Lip Sync can be used to generate lip-sync animations from audio files, and there are various threads discussing its integration with Spine, such as this one.
For motion capture, there have been some user projects and discussions, but it is generally seen as a challenging feature to implement effectively in Spine. You can find more information in threads like this one.