Skip to content
AI Ai Tool Ranks Submit Tool

Edge Dance

Create dance routines from music.

89
Visit Website

What is Edge Dance?

EDGE: Editable Dance Generation from Music is an AI tool that generates high-quality choreographies from music using music embeddings from the Jukebox model. The tool works by encoding input music into embeddings using a frozen Jukebox model and then using a conditional diffusion model to map the music embeddings to a series of 5-second dance clips. At inference time, temporal constraints are applied to batches of multiple clips to enforce temporal consistency before stitching them into an arbitrary-length full video. The tool supports arbitrary spatial and temporal constraints, making it suitable for various end-user applications, including dances subject to joint-wise constraints, motion in-betweening, and dance continuation. In addition, EDGE has a new Contact Consistency Loss that improves physical realism while keeping sliding intact and avoids unintentional foot sliding, ensuring that generated dances are physically plausible. The tool has been trained with physical realism in mind and has been shown to outperform previous work, as indicated by human raters' strong preference for dances generated by EDGE. Overall, EDGE: Editable Dance Generation from Music is a powerful AI tool suitable for generating high-quality choreographies from music, with potential applications in various industries, including entertainment and the arts.

Pros

  • Generates high-quality choreographies
  • Uses music embeddings
  • Frozen Jukebox model encoding
  • Uses conditional diffusion model
  • Enforces temporal consistency
  • Handles multiple 5-second clips
  • Supports arbitrary-length full videos
  • Arbitrary spatial and temporal constraints
  • Suitable for joint-wise constraints
  • Useful for motion in-betweening
  • Supports dance continuation
  • New Contact Consistency Loss
  • Improves physical realism
  • Avoids unintentional foot sliding
  • Physically plausible dance generation
  • Outperforms previous work
  • Highly rated by human raters
  • Applications in entertainment industry
  • Applications in arts industry
  • Creates routines from unseen music
  • Generates lower body from upper body
  • Generates upper body from lower body
  • Start and end with prespecified motions
  • Trainable with specific motion
  • Embeddings from Jukebox model
  • Generates any length dances

Cons

  • Requires trained Jukebox model
  • Limited to 5-second clips
  • Temporal consistency enforcement needed
  • Heavy reliance on constraints
  • Potential physical realism issues
  • Loss of motion continuity
  • Difficulty handling complex sequences
  • Demanding computational resources
  • Possible foot sliding inaccuracies
  • Limited end-user applications

Edge Dance FAQ

What is EDGE: Editable Dance Generation from Music?

EDGE: Editable Dance Generation from Music is an AI tool that generates high-quality choreographies from music. It uses music embeddings from the Jukebox model and a conditional diffusion model to map these music embeddings to a series of 5-second dance clips.

How does EDGE generate choreographies from music?

EDGE generates choreographies from music by encoding the input music into embeddings using a frozen Jukebox model. Then, a conditional diffusion model is used to map these music embeddings to a series of 5-second dance clips. Temporal constraints are applied to batches of multiple clips for temporal consistency before they are stitched into an arbitrary-length full video.

What is the Jukebox model used for in EDGE?

The Jukebox model in EDGE is used for gaining a broad understanding of music and creating high-quality dances even for in-the-wild music samples. It is used to encode the input music into embeddings.

How does the conditional diffusion model in EDGE work?

The conditional diffusion model in EDGE works by learning to map the music embedding into a series of 5-second dance clips. This learning process is triggered once the input music is encoded into embeddings by the Jukebox model.

What are the temporal constraints applied by EDGE?

EDGE applies temporal constraints on batches of multiple clips during inference. These constraints enforce temporal consistency before the clips are stitched into an arbitrary-length full video. Temporal constraints are also used in EDGE to generate dances of any length by imposing temporal continuity between batches of multiple sequences.

What kind of constraints can EDGE support?

EDGE can support arbitrary spatial and temporal constraints. These can be used to support applications such as arbitrarily long dances by enforcing temporal continuity between batches of multiple sequences, dances subject to joint-wise constraints like lower body generation given upper body motion, or vice versa, In-Betweening motions, and dances that start with a prespecified motion.

What is the Contact Consistency Loss in EDGE?

The Contact Consistency Loss in EDGE is a new feature that improves physical realism in the generated dances. It learns when feet should and shouldn't slide. This feature significantly improves physical realism while keeping intentional foot-ground sliding intact, thereby avoiding unintentional foot sliding.

How does EDGE ensure the physical realism of the generated dances?

EDGE ensures physical realism of the generated dances with its new Contact Consistency Loss that learns when feet should and shouldn't slide. This significantly improves physical realism while keeping intentional foot-ground contact sliding intact. The tool has been trained with physical realism in mind.