Browser-based timelapse generator with pose-based photo alignment
Pose-based alignment in the browser fixes shaky timelapses without Photoshop.
Auto-align double (or triple) ender podcast recordings. Drop in a master track and individual local tracks — PodSync finds the offset, pads/trims, and outputs aligned files ready for your DAW.
MFCC cross-correlation beats manual alignment for double-ender podcasts.
Podcast editors and audio producers
PluralEyes · Desync · DaVinci Resolve
I've wanted to automate this since 2019 (after first hearing about it in the popular podcast - Accidental Tech Podcast). I figured I'd write it in Kotlin (being my language of choice) first, but JVM audio processing wasn't there (or more fairly I just needed to put in way more work than I realized).
With AI ofc, I took another shot at it recently and finally built it in Rust.
"PodSync" takes a master track and individual participant tracks, finds the time offset for each using VAD (voice activity detection), MFCC fingerprinting, and cross-correlation, then outputs aligned WAV files. Drop them into your DAW at 0:00 and they line up!
There's an accompanying blog post with a visual on the mechanics: https://kau.sh/blog/podsync/
Would love to hear feedback!
Pose-based alignment in the browser fixes shaky timelapses without Photoshop.
Curved connectors show word order changes better than linear interlinear text.
Poetic manifesto with no executable code, falsifiable claims, or peer review.
Mel-scale FFT with spring-damper physics makes terminal visualizer actually match human hearing.
Solves a 10-year-old stale W3C CSS proposal that no other tool addresses.
A Substack essay asking for feedback, not a launchable product. Ship first, solicit later.