Exploring the incorporation of emotion embeddings at different stages of the motion generation process: (a) R-transformer Input: Influencing the refinement of motion tokens based on emotion embeddings. (b) M-transformer Input: Guiding the generation of base motion tokens conditioned on emotion embeddings. (c) VQ-VAE Input: Conditioning the encoding and decoding of motion sequences using emotion embeddings.