` and a `<meta name="author">` in your composition's HTML head. We treat them as a contract: the metadata in the MP4 should match the metadata in the source. The interesting bit for 2026: AI agents and search crawlers increasingly read MP4 metadata directly. A video downloaded from YouTube and re-uploaded to Twitter and then re-downloaded by an agent still carries the `udta` metadata you set when you encoded it. Page-level JSON-LD does not survive this trip. Atom metadata does. ## Chapter cues inside the MP4 A less-known trick: MP4 supports an embedded text track that the player reads as chapters. This is different from a sidecar WebVTT file — the chapter info lives *inside* the MP4 itself, so it survives any kind of redistribution. The format is a small text-stream track marked with `kind="chapters"`. ffmpeg can produce this from a chapters file: ``` ;FFMETADATA1 title=From DOM to MP4 [CHAPTER] TIMEBASE=1/1000 START=0 END=47000 title=Composition resolution [CHAPTER] TIMEBASE=1/1000 START=47000 END=124000 title=Chromium boot ``` Save as `chapters.txt`, then: ```bash ffmpeg -i input.mp4 -i chapters.txt -map_metadata 1 -codec copy output.mp4 ``` Apple's QuickTime, VLC, mpv, and most modern browsers will read this. YouTube will read it on upload and auto-generate the chapter sidebar. This is the single most underused video distribution trick I know. ## The schema.org evolution A few things changed in schema.org's video vocabulary recently that are worth knowing. The `transcript` property on `VideoObject` was previously a free-text string; in late 2025 schema.org accepted a proposal to allow it to point to a `Transcript` typed object with structured timestamps. This is what AI assistants read now to answer "what does this video say at minute 3?" The `learningResourceType` property is increasingly used by educational AI assistants to filter content. If your video is a tutorial, mark it as `"Tutorial"`. If it is a demonstration, `"Demonstration"`. The vocabulary is constrained; check schema.org for the current list. The `accessibilityFeature` array is read by accessibility-focused search tools. Values like `"captions"`, `"audioDescription"`, `"transcript"` tell crawlers what is available. Set them. ## The YouTube data layer A specific note for video that ends up on YouTube. YouTube reads the following on upload: - The video file's `udta` metadata (title, description, author). - Embedded chapter tracks (auto-populates the chapter sidebar). - WebVTT caption files uploaded alongside. - Any embedded subtitles inside the MP4. If you upload an MP4 to YouTube with all of these set, YouTube auto-populates 80% of the upload form. This is a small workflow win for individual creators and a large one for teams uploading hundreds of videos per week. The HyperFrames CLI ships a `--youtube-ready` flag that ensures the right atoms are written and that a `.vtt` sidecar is generated from any `<track>` elements in the source composition. It is the same data; the flag just ensures it ends up in the places YouTube expects to find it. ## What we ship by default For the curious: every MP4 the HyperFrames CLI produces in 2026 includes, by default: - `udta` metadata from the composition's `<title>`, `<meta name="author">`, and `<meta name="description">`. - Chapter cues from any `<section>` elements with `data-chapter` attributes. - A `meta` atom with a JSON blob of the composition's content hash (used for snapshot diffing). The CLI does not, by default, ship JSON-LD or WebVTT files — those are properties of the page that embeds the video, not of the video file itself. We provide a helper (`hyperframes metadata extract`) that emits a JSON-LD `VideoObject` from the MP4's metadata, which you can paste into your page or generate dynamically server-side. If you want to integrate this into a Next.js or similar site, the [Next.js integration](/integrations/nextjs) covers how we wire metadata extraction into static page generation. ## A short checklist For anyone who wants a TL;DR: - Ship JSON-LD `VideoObject` on every page that embeds video. - Set the `udta` atoms on the MP4 itself (title, author, date, description). - Add a WebVTT chapters track for any video longer than 60 seconds. - Add WebVTT captions for any video with speech. - Add a WebVTT descriptions track for any video where the visual carries information. - Embed chapter cues *inside* the MP4 for redistribution durability. - Update `accessibilityFeature` in your schema.org markup to reflect what you shipped. Each of these is a small win. Together they are the difference between a video that is *findable* and one that is invisible. That said: shipping metadata is the second-most-important thing you can do for video. The first is shipping the video. Get the video out the door, then come back and do the metadata. The order matters in that direction. We covered the shipping side in [from DOM to MP4](/blog/from-dom-to-mp4) and the agent-side distribution patterns in [why AI agents need deterministic rendering](/blog/ai-agents-need-deterministic-rendering). Now, go set some atoms. --- # Scroll-driven video: turning timelines into scroll positions URL: https://hyperframes.video/blog/scroll-driven-video-timelines Published: 2026-05-16T14:00:00.000Z Tags: tutorial, scroll, css, animation Author: ren-park Scroll-driven animations have been "coming soon" for so long that I had stopped tracking them. Then a few weeks ago I went to add a fallback for an animation on our marketing site, and I noticed the fallback was no longer needed. The `animation-timeline` property and the `scroll()` / `view()` timeline functions are now Baseline 2026 — supported in every modern browser, no flags, no polyfills. That is a quiet milestone, but for anyone doing motion on the web it changes a lot. This post is a practical tutorial on what scroll-driven animations are, when to use them instead of video, when to use them *with* video, and the specific pattern we settled on for hyperframes.dev. ## What "scroll-driven" actually means Until 2024 or so, "scroll animations" on the web meant one of two things: (1) a JS library reading `window.scrollY` and updating CSS properties on each frame, or (2) the IntersectionObserver API toggling classes when elements entered or left the viewport. Both worked. Neither was great. The JS approach was expensive and janky; the observer approach was binary, not continuous. Scroll-driven animations are different. The browser exposes the scroll position itself as a *timeline*, and you bind a normal CSS animation to that timeline instead of to wall-clock time. The animation progresses as you scroll, in lockstep. No JavaScript. No jank. The two flavors: - `animation-timeline: scroll()` — the animation progress is tied to the scroll position of an ancestor scroll container. You scroll, it animates. - `animation-timeline: view()` — the animation progress is tied to where a specific element is within the viewport. As the element enters and crosses the viewport, the animation runs. The `view()` timeline is the more useful of the two for most editorial work. It lets you say "when this section is on screen, run this animation as a function of how far through the viewport it has scrolled." ## A minimal example Here is a complete, working scroll-driven animation. Drop this into an HTML file and open it. ```html <style> @keyframes reveal { from { opacity: 0; transform: translateY(40px); } to { opacity: 1; transform: translateY(0); } } .card { animation: reveal linear; animation-timeline: view(); animation-range: entry 10% cover 30%; } </style> <div style="height: 100vh">scroll down</div> <div class="card">I appear as I scroll into view.</div> <div style="height: 100vh">scroll up</div> ``` A few things are doing work here. The `animation-timeline: view()` says "tie this animation's progress to where the element is in the viewport." The default range is `entry 0% exit 100%` — the animation starts when the element first appears at the bottom of the viewport, ends when it has just left at the top. The `animation-range: entry 10% cover 30%` is the interesting knob. It says: start the animation when the element is 10% past the start of its entry, and finish it when it has covered 30% of the viewport. This is how you control the *feel* of a scroll-driven animation. Tighten the range for snappier reveals; loosen it for slower, more cinematic ones. `animation: reveal linear` uses linear timing because the *scroll* is the timing function. The user's scroll velocity is the easing. (You can layer additional easing if you want, but it composes confusingly with scroll velocity. I usually let scroll do the work.) ## Scroll vs video: a decision tree The question I get most: when do I use scroll-driven animations vs a video element? Here is the way I think about it. **Use scroll-driven animations when:** - The motion needs to react to user interaction (scrolling, hovering, dragging). - The motion is *part of* the page's layout, not an artifact dropped on top. - The motion is short — a few seconds of content max. - The motion is text-heavy and needs to remain selectable/searchable. **Use video when:** - The motion is long (>10 seconds). - The motion involves complex pixel content (generative backgrounds, compositing, particle systems too expensive to run in CSS). - The motion needs to be embeddable across platforms (social, email, anywhere that doesn't run your CSS). - You want pixel-exact playback that doesn't depend on the user's device's rendering quirks. **Use both when:** - You are building a marketing page with a hero animation that needs to look great loading (scroll-driven CSS) but also needs to be embeddable as a video on social (HyperFrames render of the same composition). That last one is the pattern we use ourselves. The same CSS that drives the scroll-animated hero on the homepage gets rendered, frame-by-frame, into an MP4 for our Twitter and LinkedIn posts. One composition, two outputs. ## The hybrid pattern: same CSS, two timelines Here is the thing that makes this beautiful. The CSS animation does not care what timeline drives it. You can bind the same animation to scroll for the web page and to a video timeline for an MP4 render. The trick is in how HyperFrames evaluates animations. As I covered in [from DOM to MP4](/blog/from-dom-to-mp4), HyperFrames drives animations by manipulating `--hf-time` and animation delays. For scroll-driven animations specifically, HyperFrames provides a `--hf-scroll-progress` CSS variable that the composition can use to *simulate* scroll position deterministically. A pattern that works: ```css @keyframes reveal { from { opacity: 0; transform: translateY(40px); } to { opacity: 1; transform: translateY(0); } } .card { animation: reveal linear forwards; /* Live web page: drive by scroll. */ animation-timeline: view(); animation-range: entry 10% cover 30%; /* HyperFrames render: drive by --hf-scroll-progress (set by the render harness to a synthetic scroll position). */ @media (--hf-render) { animation-timeline: none; animation-duration: 1s; animation-delay: calc(var(--hf-scroll-progress, 0) * -1s); animation-play-state: paused; } } ``` The `(--hf-render)` media query is a custom feature HyperFrames sets when rendering. In a normal browser, the scroll timeline drives the animation. In the render path, the synthetic time variable drives it. Same animation, two timelines. ## A real example: the hyperframes.dev hero Here is what we actually ship on the homepage, slightly simplified. The hero is a sequence of three "moments" — title, subtitle, demo strip — that scroll into view in sequence. ```html <section class="hero"> <h1 class="hero-title">HTML in. MP4 out.</h1> <p class="hero-sub">Deterministic video for the agentic web.</p> <div class="hero-demo">[demo embed]</div> </section> <style> .hero { min-height: 200vh; padding-top: 20vh; } .hero-title, .hero-sub, .hero-demo { animation: rise linear forwards; animation-timeline: view(block); animation-fill-mode: both; } .hero-title { animation-range: entry 0% entry 40%; } .hero-sub { animation-range: entry 15% entry 55%; } .hero-demo { animation-range: entry 30% entry 70%; } @keyframes rise { from { opacity: 0; transform: translateY(60px); } to { opacity: 1; transform: translateY(0); } } </style> ``` The `animation-range` values stagger the elements: the title is already settling as the subtitle starts, which is already in motion as the demo starts. The scroll velocity determines how fast or slow the whole sequence runs. A slow scroll lets the user *read* each element as it arrives; a fast scroll blows through and shows the final state. Both feel intentional. For the social version of this hero, we render the same composition through HyperFrames at a fixed pace (the harness drives `--hf-scroll-progress` from 0 to 1 over 6 seconds). The output is a 6-second MP4 that we post to Twitter. Same CSS. Same animation. Two outputs. ## The pitfalls I have hit A short list of things that have bitten me, that I have not seen documented well. **Pitfall 1: `animation-fill-mode: both` is essential.** Without it, the element snaps back to its initial state when the scroll position is outside the animation range. With it, the element stays at whichever end of the animation is closer. **Pitfall 2: reduced motion is your responsibility.** Scroll-driven animations are particularly nauseating for users who prefer reduced motion. Wrap your animations in `@media (prefers-reduced-motion: no-preference)` or provide a static fallback. Browsers will not do this for you. **Pitfall 3: `view()` timelines are based on the *scroll container's* viewport, not the actual visible viewport.** If you have nested scroll containers, the timeline reference can surprise you. Use `view(block)` to be explicit. **Pitfall 4: animation events do not fire on scroll-driven timelines.** If you rely on `animationend` to trigger something, it will not fire when the timeline is scroll. Use `IntersectionObserver` for completion signals. ## What I would build next Scroll-driven animations in 2026 are the same kind of unlock that CSS Grid was in 2018. They take a thing that was previously hard-and-jank (scroll-tied motion) and make it the default. The interesting question is what *new* design patterns this opens up. A few I have been exploring: - **Scroll-driven storytelling** — long-scroll pages where the narrative pace is set by the user's scroll. Think *Snow Fall* but built in 40 lines of CSS instead of a custom JS framework. - **Document-as-video** — a hybrid where the same composition is read as a scrollable page on the web and rendered as a linear video for social. We are dogfooding this pattern; it is changing how we structure our [docs](/docs) and blog posts. - **Interactive product demos** — scroll-driven walkthroughs that double as embeddable videos for marketing. Build once, distribute everywhere. If you want to try the hybrid pattern yourself, the [playground](/playground) supports `--hf-scroll-progress` natively now. Drop in a scroll-driven composition, render it, get an MP4. Same input, two outputs. That is the version of the web I want to live in. --- # Lower thirds and broadcast graphics in pure HTML URL: https://hyperframes.video/blog/lower-third-broadcast-graphics Published: 2026-05-16T11:00:00.000Z Tags: lower-third, broadcast, css, marketing Author: marcus-okafor A lower-third is the typographic name-card that overlays the bottom of a video frame. It identifies who is speaking. It is the single most-reused broadcast asset, the asset most teams overpay for, and the asset most production tools handle worst. It is also four CSS properties and one animation. Here is the version. ## The anatomy Two lines of text — name on top, role on the bottom — sitting in the lower-left or lower-right of the frame. A thin accent bar to the left. A subtle box behind. That is the entire visual. The motion brings it on with a slide-in and takes it off with a fade. ## The four elements 1. **The name** — heaviest weight, largest size, mixed-case. 2. **The role** — lighter, smaller, often all-caps, tracked-out. 3. **The accent** — a 4px vertical bar in the brand color, sitting flush-left of the type. 4. **The container** — a low-opacity dark box, blurred behind, that lets the type stay legible over any footage. That is it. Anything else (logos, animations on the role line, sponsored-by tags) is a brand decision that should fight for inclusion. ## The motion The slide-in is on a [settle curve](/blog/easing-that-looks-like-money). The whole graphic enters from the left, with the accent bar arriving first and the type cascading after. ```css .lt { transform: translateX(-110%); transition: transform .9s cubic-bezier(.16, 1, .3, 1); } .lt.in { transform: translateX(0); } .lt .name { opacity: 0; transition: opacity .4s ease .3s; } .lt.in .name { opacity: 1; } .lt .role { opacity: 0; transition: opacity .4s ease .5s; } .lt.in .role { opacity: 1; } ``` The accent bar enters with the container. The name fades in 300ms after the slide starts. The role fades in 200ms after the name. Total motion: 900ms slide + 200ms tail = 1.1s. Long enough to read, short enough to not annoy. ## The knobs Three variables a producer will actually touch: <VariableKnobs html={`<style>body{margin:0;background:#0a0a0a;height:100vh;position:relative;font-family:ui-sans-serif,system-ui;} .lt{position:absolute;bottom:80px;left:80px;display:flex;align-items:stretch;gap:0;} .bar{width:6px;background:{{$ACCENT}};} .box{background:rgba(10,10,10,.78);backdrop-filter:blur(8px);padding:18px 28px;border-left:0;color:white;} .n{font-size:36px;font-weight:800;letter-spacing:-.02em;} .r{font-size:13px;letter-spacing:.3em;text-transform:uppercase;color:rgba(255,255,255,.65);margin-top:6px;}</style> <div class="lt"><div class="bar"></div><div class="box"><div class="n">{{$NAME}}</div><div class="r">{{$ROLE}}</div></div></div>`} knobs={[ { name: "NAME", label: "Name", default: "Marcus Okafor" }, { name: "ROLE", label: "Role", default: "Design Lead, HyperFrames" }, { name: "ACCENT", label: "Brand accent", type: "color", default: "#ff3b1f" } ]} height={280} caption="Three knobs cover most lower-third use cases." /> ## Compare: default versus branded The same lower-third with two different brand systems. Notice how much the accent color does on its own. <CompareSlider beforeHtml={`<style>body{margin:0;background:#0a0a0a;height:100vh;position:relative;font-family:ui-sans-serif,system-ui;} .lt{position:absolute;bottom:80px;left:80px;display:flex;} .bar{width:6px;background:#888;} .box{background:rgba(10,10,10,.78);padding:18px 28px;color:white;} .n{font-size:36px;font-weight:800;}.r{font-size:13px;letter-spacing:.3em;text-transform:uppercase;color:rgba(255,255,255,.65);margin-top:6px;}</style> <div class="lt"><div class="bar"></div><div class="box"><div class="n">Marcus Okafor</div><div class="r">Design Lead</div></div></div>`} afterHtml={`<style>body{margin:0;background:#0a0a0a;height:100vh;position:relative;font-family:ui-sans-serif,system-ui;} .lt{position:absolute;bottom:80px;left:80px;display:flex;} .bar{width:6px;background:#ff3b1f;} .box{background:rgba(10,10,10,.78);padding:18px 28px;color:white;} .n{font-size:36px;font-weight:800;}.r{font-size:13px;letter-spacing:.3em;text-transform:uppercase;color:rgba(255,255,255,.65);margin-top:6px;}</style> <div class="lt"><div class="bar"></div><div class="box"><div class="n">Marcus Okafor</div><div class="r">Design Lead, HyperFrames</div></div></div>`} labelBefore="Generic" labelAfter="Branded" caption="A 6px accent bar carries 80% of the brand identity." /> ## The traps Three things that bite first-time lower-third designers: ### 1. Legibility on busy footage A lower-third that looks great over a static dark background falls apart over busy footage. The fix is contrast — either a darker container background (`rgba(10,10,10,.85)` not `.5`) or a subtle drop shadow on the type itself. Test against the worst-case footage you have. If it does not read there, the lower-third is broken. ### 2. The exit animation Most lower-third exits are too slow. The standard is "fade out over 600ms while sliding 30px right." This reads as polite. The better version is "fade out over 250ms in place." Exits should not draw attention. The faster the exit, the less the viewer notices it. ### 3. Stacking If two lower-thirds need to appear at the same time (interview with two people), do not stack vertically. Pick one in each corner. Vertical stacks read as crowded; corner-paired reads as a two-shot. ## Production-grade workflow The path from this template to a broadcast workflow: 1. Build a `lower-third.html` template with placeholders. 2. Maintain a `talent.csv` with `name,role,accent` per person. 3. In CI, render a clean MP4 with alpha (or transparent overlay) per row. 4. Editor drops the right MP4 onto the right shot. For [marketing teams](/use-cases/marketing), the same template renders the social-cut version with a 9:16 aspect and a larger type scale. One source of truth, multiple outputs. ## Why this beats a tool A motion graphics plugin will give you a lower-third with twenty knobs. You will use four of them. The other sixteen are surface area for the brand to drift across episodes. A code template with exactly four variables enforces consistency at the API level — a producer cannot ship a lower-third with the wrong color because the wrong color is not on the menu. This is the underrated value of code-as-design: the system enforces the brand. The [developers guide](/developers) covers wiring the template into your CI; the [HyperFrames render API](/tools/html-to-video) is the underlying infrastructure. The lower-third is the place to start. If you ship the next ten episodes with a code-driven lower-third, you will start seeing every other broadcast asset the same way. --- # CSS animated pie chart (and donut) — no JavaScript required URL: https://hyperframes.video/blog/animated-pie-chart-css Published: 2026-05-16T09:00:00.000Z Tags: css, svg, charts, tutorial, data-viz Author: kira-tanaka Pie charts get a bad reputation from BI dashboards, but in motion graphics they are perfect: a single ratio, big, animated, done. A 5-second video of a donut filling from 0% to 64% communicates "share of voice grew this quarter" better than any line chart. There are two ways to draw an animated pie chart with zero JavaScript. Both render deterministically to MP4. Here is when to use each. ## Technique 1 — `conic-gradient` (for filled pies) `conic-gradient` paints an angular sweep around a center. For a pie chart, the syntax is delightful: ```css .pie { background: conic-gradient(var(--accent) 0 var(--angle), #1a1a1a 0); } ``` Set `--angle` to `120deg` for a third of the circle, `216deg` for 60%. Animate `--angle` from `0deg` to your target and the slice grows. To make this work in CSS animations (which can't lerp custom properties by default), declare it with `@property`: ```css @property --angle { syntax: '<angle>'; initial-value: 0deg; inherits: false; } ``` Now `--angle` is animatable. <InlineSandbox html={`<!doctype html> <html><head><style> @property --angle { syntax: '<angle>'; initial-value: 0deg; inherits: false; } body{margin:0;background:#0a0a0a;color:#fff;height:100vh;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .wrap{display:grid;grid-template-columns:auto auto;gap:48px;align-items:center;} .pie{--angle:0deg;width:220px;height:220px;border-radius:50%;background:conic-gradient(#ff3b1f 0 var(--angle),#1a1a1a 0);animation:fill 4s cubic-bezier(.2,.7,.2,1) infinite alternate;position:relative;} .pie::after{content:"";position:absolute;inset:18%;border-radius:50%;background:#0a0a0a;} @keyframes fill { to { --angle: 230deg; } } .lbl{font:600 12px ui-monospace,monospace;letter-spacing:.2em;text-transform:uppercase;color:rgba(255,255,255,.55);} .val{font:700 56px ui-sans-serif,system-ui;letter-spacing:-.03em;margin-top:8px;} .cap{font-size:14px;color:rgba(255,255,255,.6);margin-top:6px;max-width:200px;} </style></head><body> <div class="wrap"><div class="pie"></div><div><div class="lbl">share of voice</div><div class="val">64%</div><div class="cap">Q2 brand mentions, weighted by reach</div></div></div> </body></html>`} height={320} caption="conic-gradient + @property — pure CSS, no JS." /> The donut hole is just an `::after` with the background color, sized to 64% of the parent. Add a label and a value next to it and you have a full chart. ## Technique 2 — SVG `stroke-dashoffset` (for outlined donuts) When you need a stroked ring instead of a filled wedge — the "progress dial" look — use an SVG circle with `stroke-dasharray` set to the circle's circumference and animate `stroke-dashoffset`: <CodeTabs tabs={[ { label: "HTML", lang: "html", code: `<svg width="220" height="220" viewBox="0 0 100 100"> <circle cx="50" cy="50" r="42" fill="none" stroke="#1a1a1a" stroke-width="10"/> <circle cx="50" cy="50" r="42" fill="none" stroke="#ff3b1f" stroke-width="10" stroke-linecap="round" stroke-dasharray="264" stroke-dashoffset="95" transform="rotate(-90 50 50)"/> </svg>`, }, { label: "Math", lang: "txt", code: `circumference = 2 * PI * r = 2 * PI * 42 ≈ 264 For a 64% ring: visible = 264 * 0.64 = 169 hidden = 264 - 169 = 95 stroke-dasharray = 264 stroke-dashoffset = 95 To animate, lerp dashoffset from 264 → 95.`, }, { label: "Live", html: `<!doctype html><html><body style="margin:0;background:#0a0a0a;display:grid;place-items:center;height:100vh;"> <svg width="240" height="240" viewBox="0 0 100 100"> <circle cx="50" cy="50" r="42" fill="none" stroke="#1a1a1a" stroke-width="10"/> <circle cx="50" cy="50" r="42" fill="none" stroke="#ff3b1f" stroke-width="10" stroke-linecap="round" stroke-dasharray="264" transform="rotate(-90 50 50)"> <animate attributeName="stroke-dashoffset" from="264" to="95" dur="2s" fill="freeze" repeatCount="indefinite"/> </circle> </svg> </body></html>`, }, ]} caption="Three views of the same donut: source, the math, and the rendered result." /> The `rotate(-90)` moves the ring's start from 3 o'clock to 12 o'clock — what people expect. `stroke-linecap="round"` softens the leading edge. ## Multi-segment pies For a pie with multiple slices (a real share-of-X chart), stack `conic-gradient` color stops: ```css background: conic-gradient( var(--brand-a) 0 30%, var(--brand-b) 30% 55%, var(--brand-c) 55% 78%, #2a2a2a 78% 100% ); ``` To animate this from zero, animate each stop's end value with separate custom properties. It's verbose but mechanical — and for a fixed number of slices (usually 3–5), the verbosity is fine. ## Color systems for pie charts A multi-segment pie lives or dies by its color palette. The rule that works: 1. **One brand color at full saturation** for the headline slice. 2. **Two desaturated neighbors** for context slices. 3. **A neutral gray** for "other." Avoid the BI-tool palette of seven hue-shifted primaries. Three colors plus a gray reads as designed; seven reads as a spreadsheet. ## Rendering to MP4 Both techniques are pure CSS/SVG, so they render through the [HyperFrames pipeline](/tools/html-to-video) without any special handling. The conic-gradient version is slightly cheaper to rasterize; the SVG version composites better against textured backgrounds. If you're targeting Instagram (1080×1080), make the pie 60% of the canvas width and put the label to the right. If you're targeting Reels (1080×1920), stack them vertically. The template is the same; the layout changes per ratio. ## When you actually need Chart.js Pie charts with live tooltips, drill-down click handlers, or thirty-plus segments are not motion-graphics problems — they are dashboard problems. Use a real chart library. But for the one ratio you want to launch a brand campaign around, sixty lines of CSS is the right amount of code. Open the [playground](/playground), drop the donut example in, ship the video. --- # Generative motion design: LLMs writing CSS animations URL: https://hyperframes.video/blog/llms-writing-css-animations Published: 2026-05-15T20:00:00.000Z Tags: design, ai, llms, css, motion Author: marcus-okafor I had a small, dispiriting moment last fall. A client asked if they could "just have the AI do the motion." I made the right professional noises, but I went home that night and ran the test myself: I gave the same brief to three different LLMs and graded the output the way I would grade a junior designer's first attempt. The result was more interesting than I expected. The models were not as bad as I had hoped (this is honest), and not nearly as good as the client thought (this is also honest). This post is a longer version of that experiment. I have spent the last six months poking at how well frontier LLMs produce motion design — CSS keyframes, easing curves, timing, composition — and I want to share what I have found. The TLDR: models in 2026 can write *competent* animation. They cannot, yet, write *tasteful* animation. The gap is small in lines of code and very large in feeling. The good news is that the gap is closable with the right prompt. ## The test rig I asked four models — pick your current frontier favorites — to produce a 4-second CSS animation of a single headline word "ARRIVES" with subtle character stagger. The brief specified 1080p, 60fps, white text on near-black background, no JavaScript, brand-appropriate easing, "should feel premium." I rendered each one through HyperFrames (deterministic, so the model's output is what you see, no encoder noise) and graded. The grades, on my private scale of 1-10: - Model A: 7/10. Good defaults, sensible easing, one weird overshoot. - Model B: 6/10. Technically correct, visually generic, easing was `ease-in-out` on everything. - Model C: 8/10. Surprisingly tasteful default. Used a custom cubic-bezier I would have used. - Model D: 5/10. Worked, but felt like 2017. These numbers are not a benchmark. They are one designer's eye on one brief. But the *patterns* in the failures are stable enough that I want to spend the rest of the post on them. ## Failure mode 1: ease-in-out everywhere The single most common LLM motion failure is using `ease-in-out` (or worse, `ease`) for every transition. The browser default, the most commonly-seen value in training data, the easing of least imagination. I wrote a whole post about why this is the wrong default in [easing that looks like money](/blog/easing-that-looks-like-money), so I will not relitigate. But for LLMs specifically, the fix is simple: name a curve in the prompt. A prompt like "use a settle easing of `cubic-bezier(.16, 1, .3, 1)` for elements arriving, and `cubic-bezier(.7, 0, .84, 0)` for elements departing" lifts the median output a full grade point. The model is happy to use specific curves; it just defaults to generic ones when you do not specify. ## Failure mode 2: overshoot mismatch The second most common failure is overshoot misplaced. The model will, unprompted, sprinkle `cubic-bezier(.34, 1.56, .64, 1)` (a bouncy overshoot) on every element. Or it will use overshoot on the wrong element — on a body subtitle instead of the headline. The issue here is one of editorial judgment. Overshoot is a *choice* you make about *which* element gets the dramatic moment. A model has no concept of "which element matters most"; it sees all elements as roughly equal candidates for the dramatic treatment. The fix is to be explicit: "the word ARRIVES is the hero. It gets overshoot. Everything else uses the settle curve." The model will respect this hierarchy if you state it. ## Failure mode 3: timing that does not breathe Models love to pack animation into 300ms. I think this is because most CSS animation examples in training data are micro-interactions (hover states, button presses), which are correctly short. For *editorial* motion — title reveals, cinematic title cards, hero animations — the duration that feels right is more like 800-1200ms for the lead element, with 60-100ms of stagger between characters or sub-elements. Asked unprompted, models will write `animation-duration: 0.4s` for everything. Asked with the constraint "this is editorial motion, durations should be in the 700-1200ms range," the output gets noticeably better. A trick that works particularly well: give the model a *budget* rather than a target. "The hero word should arrive over 900ms; the staggered characters should span the first 600ms; the final 300ms is the settle." The budget framing maps to how human designers think about timing, and the model picks it up. ## Failure mode 4: forgetting `animation-fill-mode: both` A specific technical failure that costs grades. CSS animations default to `animation-fill-mode: none`, which means the element returns to its un-animated state after the animation ends. For static frames at the end of a render — which is most editorial motion — you want `forwards` or `both`. LLMs omit this maybe 70% of the time. The fix in prompts is: "Set `animation-fill-mode: both` on all animations." That is the literal sentence I add. It works. ## Failure mode 5: no stagger, or wrong stagger Models are weirdly bad at character or element staggers. They will either (a) animate everything simultaneously, which is dull, or (b) compute the stagger as a JavaScript loop with `setTimeout`, which violates the no-JS constraint I set. The right pattern in 2026 CSS is to use `animation-delay` with an index variable. Pure CSS, no JS: ```css .word span { display: inline-block; animation: arrive 900ms var(--settle) both; animation-delay: calc(var(--i) * 60ms); } .word span:nth-child(1) { --i: 0; } .word span:nth-child(2) { --i: 1; } .word span:nth-child(3) { --i: 2; } /* ... */ ``` Show the model this pattern once, in the prompt, and it will apply it correctly from then on. Do not show it the pattern, and it will reach for JavaScript. ## What the prompt that fixed everything looks like After enough iterations I converged on a "motion brief" prompt template that consistently produces 8-9/10 results. It is long but mostly a list of constraints: ``` Produce a CSS-only animation for the headline word "ARRIVES" rendered at 1920×1080, 60fps, intended for HyperFrames deterministic render. Constraints: - Editorial motion, not UI motion. Durations 700-1200ms range. - One hero moment. The headline word gets `cubic-bezier(.7, -.5, .4, 1.4)`. - Everything else uses the settle: `cubic-bezier(.16, 1, .3, 1)`. - Character stagger of 60ms between characters, via animation-delay. - animation-fill-mode: both on all animations. - No JavaScript. No web fonts. No external resources. - Background is #0a0a0a. Text is #f4f4f4. One accent in #ff5a5f if needed. - The animation should resolve by t=2.5s, then hold. ``` That prompt, against current frontier models, produces output I would ship in a portfolio piece, with maybe one easing tweak. The model is doing the typing; I am doing the judgment. ## The "easing taste gap" is real but narrow A finding that surprised me: the gap between "model output" and "designer output" is not in *creativity* or in *technique* — it is almost entirely in *easing*. Two animations with identical keyframes and identical timing will feel completely different depending on the curve. The model gets the keyframes and timing roughly right; it gets the curve wrong in a specific, identifiable way. This is good news for the field. It means the gap is closable not by waiting for better models, but by encoding the missing taste in tooling. We do this in HyperFrames by shipping a small library of named curves you can reference in MDX or HTML: `--ease-settle`, `--ease-launch`, `--ease-anticipate`, `--ease-whisper`. Models pick up named curves more reliably than four-number tuples. The cognitive load is lower. If you are building any kind of LLM-assisted animation tool, this is the lever I would pull first: name the curves, document them well, prompt the model to use them by name. The output quality jumps. ## What models still cannot do Honesty section. There are things current LLMs cannot do well in motion, and I want to name them. - **Composition.** "Make a hero shot for a fintech ad" produces something that looks like an LLM made it. Not bad; not distinct. The compositional choices — what is foregrounded, what is held back, what *moves* and what stays still — are where models still feel generic. - **Brand voice.** A motion that "feels like Stripe" vs one that "feels like Square" requires the model to have internalized the brand. Some can, with explicit reference; most cannot, without. - **Pacing across longer sequences.** A 4-second animation is easy. A 30-second sequence with multiple beats, breathing room, and rhythm is hard. Models struggle to hold an arc. The first two are likely closable in the next year as models get better at style mimicry. The third one I am less sure about — pacing is editorial, and editorial is the slow part. ## How HyperFrames fits The reason we care about this at HyperFrames is simple: when an LLM writes the animation and HyperFrames renders it, the loop is tight. The model writes HTML. The render is deterministic. The reviewer (human or LLM) sees exactly what the model produced. Iteration is possible. Compare this to "model writes a prompt, prompt goes to Sora, Sora produces something different each time." The loop is broken. The model cannot reliably tell whether its change helped, because the renderer added noise. We wrote about this in [why agents need deterministic rendering](/blog/ai-agents-need-deterministic-rendering). The practical workflow that has emerged in our own team: a model writes the first draft of an animation, we render it deterministically in the [playground](/playground), we look at the result, and we either accept it or write a single-sentence note for the model to revise. The note is almost always about easing. Almost always. ## Where this goes My current bet is that within two years, an LLM-plus-deterministic-renderer pipeline will produce production-quality editorial motion for the majority of marketing content. The pieces are all there: the models can write the HTML, the renderer can produce the MP4, the only missing piece is the layer of taste that picks the right curve. That layer is mostly tooling and prompting, not model capability. The pieces *I* care about, as a designer, are the ones above that layer: composition, brand voice, pacing. Those will remain human work for longer. Which is, to be honest, the way I want it. If you are running these experiments yourself, the [playground](/playground) is free and deterministic — paste the model's output, render it, judge it. That is the whole workflow. Bring better prompts than I did and you will get better results. --- # Why AI agents need deterministic rendering primitives URL: https://hyperframes.video/blog/ai-agents-need-deterministic-rendering Published: 2026-05-15T13:00:00.000Z Tags: ai, agents, determinism, strategy Author: hf-team We have a thesis about agents and we have been refining it for two years. Here it is in one sentence: the rate-limiting step in agentic systems is not the model, it is the *feedback signal*. Models are smart. Loops are not, unless the thing they loop against gives them a clean comparable answer. Most of the visible work on agents in 2024 and 2025 went into the models. Better tool use, better planning, longer context. The work that gets less attention, but matters at least as much, is the work on what the agent looks at while it iterates. That work is about determinism. This post is about why deterministic rendering — same HTML, same MP4, byte-identical, every time — is one of the small set of primitives that an agentic video stack needs. It is also about what an agent-friendly render API looks like in practice, which is mostly the opposite of what a human-friendly render API looks like. ## The agent is a loop A useful frame, before we get to video. An agent, stripped down, is: 1. Observe state. 2. Compare to goal. 3. Take an action. 4. Observe new state. 5. Did the action move toward the goal? Repeat. This is, recognizably, the same shape as a control system. The literature on control systems has a lot to say about what makes this loop converge or diverge, and the most important variable, by a wide margin, is the *noise on the observation*. If you cannot reliably tell what your action did, you cannot reliably improve. Generative models — the same ones we praise as the future of video — produce noisy observations *by construction*. The output is sampled. Run the same prompt twice and you get two different videos. For a human authoring a hero shot, this is a feature; for an agent comparing draft *n* to draft *n+1*, it is the worst possible property. ## Snapshot testing is a 30-year-old idea The closest analog to what agents need from video is what software developers have used for decades: snapshot tests. You serialize your output, check it in, and assert that future runs produce the same serialization. When something changes, you see the diff, decide if you meant it, and update the snapshot. Snapshot testing only works if the output is deterministic. If your function returns a different result every time, the snapshot is noise and the test is worse than nothing. The same applies to video. If `render(html)` produces a different MP4 every call, you cannot snapshot it. If it produces the same MP4 every call, you can. We use this directly in HyperFrames CI. Every blog header animation, every demo embed, every showcase render, has a snapshot. When we change the renderer, we run the suite and look at the diffs. A diff means either a regression (revert) or an improvement (re-bless the snapshots and ship). There is no third state called "well, it usually looks fine." Agents need the same thing. An agent improving a video composition needs to be able to: 1. Render version *n*. 2. Take some action — change a CSS variable, swap a font, move an element. 3. Render version *n+1*. 4. Diff. Decide. Iterate. If the diff between *n* and *n* (no changes) is nonzero, the entire loop is poisoned. The first thing a serious agentic system needs from its renderer is the guarantee that *no diff means no change*. ## What "deterministic" actually has to mean A common pushback: "Most renderers are *mostly* deterministic. Isn't that good enough?" No. There is a specific bar, and most renderers do not clear it. We laid out the full taxonomy in [the deterministic video manifesto](/blog/deterministic-video-manifesto), but the short version is that "deterministic" for an agent has to mean: - **Bit-identical across runs on the same machine.** Same input, same binary, same bytes out. - **Bit-identical across machines with the same engine version.** This is harder. It requires pinning the renderer's dependencies (Chromium version, encoder build, font set) and freezing all sources of nondeterminism (the system clock, random seeds, hardware encoder quirks). - **Visually identical across encoders.** A weaker bar — bytes may differ between, say, an Intel and ARM build, but the pixel content matches within an imperceptible tolerance. Most off-the-shelf rendering tools clear the first bar and fail the second. Headless Chromium drives a video that *looks* the same on two machines but encodes to slightly different bytes because of GPU rasterizer differences. For human-authored content, this is fine. For an agent's feedback loop, it is the failure mode. ## What an agent-friendly render API looks like We have spent two years iterating on this — first by watching agents fail to use our API, then by adding the things they needed. Some observations. **1. Synchronous, errors-as-values.** Agents do not handle exceptions well. They do handle structured returns well. Our render function returns `{ ok: true, path, manifest }` or `{ ok: false, errors: [...] }`. The errors are structured, with codes the agent can reason about (`fonts-not-loaded`, `unsupported-property`, `timeout-in-hf-ready`). They are not stack traces. **2. Deterministic content hashes.** Every render emits a `compositionHash` that is a function of the inputs alone. If two renders produce the same hash, the agent does not need to look at the output; it knows the output. We cache aggressively on this. **3. A small, total API surface.** The renderer's contract is "HTML in, MP4 out." Not "HTML in, optionally some JSON config, optionally some font overrides, optionally a callback, optionally a..." Every optional knob is a thing the agent has to learn. We push configuration into the HTML (as `<meta>` tags or CSS variables) precisely because the model already speaks HTML fluently. **4. Frame-pinned timing.** No `setTimeout`, no `requestAnimationFrame` driving state. Animations are functions of `--hf-time`. This means the agent can reason about what's on screen at second 3 by reading the CSS, not by simulating the JS event loop. We get into this in more depth in [frame-accurate timing in the browser](/blog/frame-accurate-timing-browser-2026). **5. Inspectable intermediate state.** When a render fails, the agent gets the *last good frame* and a description of what went wrong, not just a "render failed." This is the single most valuable affordance for agents iterating on layout. The agent can look at the broken state and reason about the fix. **6. Cheap to run.** A render budget of 1-2 seconds is the bar. An agent that has to wait 30 seconds per iteration gives up after three tries. An agent that gets feedback in 1.5 seconds will iterate fifty times in a minute. This is not a UX nicety; it is the difference between an agent that converges and one that does not. ## The "agent OS" framing A frame we have started using internally. The 2026 stack for an agent doing complex visual work needs primitives the way a process on an OS needs syscalls. A few of those primitives: - **Read the world**: web search, web fetch, file read. - **Write the world**: file write, API call, code execution. - **Render the world**: produce visual artifacts (charts, video, images) deterministically. - **Evaluate the world**: compare outputs, score against rubrics, decide. Render and evaluate are the two that have lagged. Read and write are commoditized — every major model provider ships them now. Render-the-world is what HyperFrames is. Evaluate-the-world is what visual diff tools and the various scoring models are starting to be. The agent OS is not a metaphor in the philosophical sense; it is a real engineering target. The same way a kernel exposes `read`, `write`, `open` to a process and lets the process not care how the disk works, the agent OS exposes `render`, `evaluate`, `compose` and lets the agent not care how the renderer works. Determinism is the property that makes those primitives composable. If `render` is nondeterministic, `evaluate` becomes statistical and `compose` becomes lossy. The whole stack degrades. ## What we built, and what we changed because of agents The HyperFrames API was originally designed for humans. The early users were motion designers and developers iterating on hero animations. Around mid-2024 we started seeing requests from agentic systems, and the requests had a different shape: - "Can I get the render manifest as a JSON header so I do not have to parse the binary?" - "Can the CLI exit nonzero when fonts fall back, instead of silently substituting?" - "Can I get a list of every CSS rule that did not match anything?" We said yes to all of these, because the feedback they wanted was the feedback that made *human* debugging easier too. Agents are, in our experience, an unusually effective stress test for API ergonomics. Every change we made for agents made the human-facing CLI better. The one place we have not yet caved: we will not add a "creative" mode that introduces nondeterminism in exchange for more interesting output. That is what generative video is for. Our value is the opposite: same input, same output, every time. We covered this tradeoff in more detail in [the AI video landscape](/blog/ai-video-landscape-2026). ## What the next year looks like A few predictions, lightly held. First, deterministic rendering becomes a baseline expectation in agentic stacks, the same way "structured outputs" became a baseline expectation for LLMs in 2024. Tools that ship nondeterministic output will be marked as "for human use only." Second, the visual-evaluation half of the agent OS will get its own renaissance. The current frontier — VLMs judging "is this video good?" — is too coarse and too slow. We expect cheaper, faster, more specific evaluators (does the headline match? is the chart axis correct? does the color match the brand kit?) to ship. Third, the gap between human-facing video tools and agent-facing video tools will widen. Human tools optimize for expression. Agent tools optimize for predictability. The same way that human-facing programming languages diverged from machine-facing instruction sets, video tools will split into "for designers" and "for agents." We are betting on the second one. If you are building agents that touch video and have not yet hit the determinism wall, you will. When you do, the [playground](/playground) is the fastest way to play with a deterministic renderer, and the [developer docs](/developers) cover the agent-facing surface. Come find us. --- # Wipe transitions and masked reveals (underused CSS tricks) URL: https://hyperframes.video/blog/wipe-transitions-clip-path Published: 2026-05-15T10:30:00.000Z Tags: css, transitions, clip-path, tutorial Author: ren-park `clip-path` is the most expressive CSS property nobody learns in their first three years on the platform. It can do every transition you remember from television — the diagonal wipe, the iris-in, the bar-wipe — in two lines of CSS. No JS, no library, no framework. Here are six clip-path wipes I use in production, the timing each needs, and when to reach for which. ## What clip-path actually does `clip-path` defines a region of an element that is visible. Anything outside the region is invisible (not removed — just not rendered). The region can be an inset, a circle, a polygon, an ellipse, or an SVG path. Animate the *region*, and the element appears to wipe in or out. Because clip-path is GPU-accelerated, it stays at 60fps on every device that matters. ## Wipe 1: linear (left-to-right) The default. The content reveals from left to right. ```css .front { clip-path: inset(0 100% 0 0); transition: clip-path 1s ease-out; } .front.in { clip-path: inset(0 0 0 0); } ``` When to use: most of the time. The linear wipe is invisible if you do not pay attention to it; that is often the point. ## Wipe 2: diagonal A polygon that sweeps in at an angle. Reads as more energetic than linear. ```css clip-path: polygon(0 0, 100% 0, 50% 100%, 0 100%); ``` Animate the polygon's right-edge x-coordinate from 0 to 100%. The angle stays constant; only the extent changes. When to use: B-roll between two action shots, sports content, anything that wants "momentum." ## Wipe 3: iris A shrinking circle. The classic "ending" wipe — used at the end of Looney Tunes cartoons for exactly this reason. ```css clip-path: circle(70% at 50% 50%); /* animate to: */ clip-path: circle(0% at 50% 50%); ``` When to use: end of sequences, final reveals, jokes with a "punchline" structure. ## Wipe 4: bar wipes (venetian blinds) Multiple parallel strips that wipe in alternating directions. Cinema's "broadcast-style" transition. ```css clip-path: polygon( 0% 0%, var(--w) 0%, var(--w) 16.6%, 0% 16.6%, 100% 16.6%, calc(100% - var(--w)) 16.6%, calc(100% - var(--w)) 33.3%, 100% 33.3%, /* ... continues for each strip */ ); ``` When to use: title sequences, broadcast-style intros, anything that wants to *announce* the transition. ## Wipe 5: vertical reveal Like the linear wipe, but top-to-bottom. Smaller in effect, larger in elegance. ```css clip-path: inset(100% 0 0 0); /* animate to: */ clip-path: inset(0 0 0 0); ``` When to use: text reveals, lower-thirds appearing, any content that should feel like it is "growing" into the frame. ## Wipe 6: circle-out The inverse of iris. A circle that grows from the center to fill the frame. ```css clip-path: circle(0% at 50% 50%); /* animate to: */ clip-path: circle(75% at 50% 50%); ``` When to use: opening reveals, "big idea" hero shots, anything where you want to imply the content is *bursting* into view. ## The CodeTabs view The complete CSS and the result side by side: <CodeTabs tabs={[ { label: "CSS", code: `/* Wipe 1: linear */ .linear { clip-path: inset(0 100% 0 0); transition: clip-path 1.2s cubic-bezier(.16, 1, .3, 1); } .linear.in { clip-path: inset(0 0 0 0); } /* Wipe 3: iris */ .iris { clip-path: circle(70% at 50% 50%); transition: clip-path 1.2s ease-in-out; } .iris.in { clip-path: circle(0% at 50% 50%); } /* Wipe 6: circle-out */ .circle-out { clip-path: circle(0% at 50% 50%); transition: clip-path 1.2s cubic-bezier(.7, 0, .3, 1); } .circle-out.in { clip-path: circle(75% at 50% 50%); }` }, { label: "HTML", code: `<div class="front linear">FROM</div> <div class="back">TO</div> <button onclick="document.querySelector('.front').classList.toggle('in')">Wipe</button>` }, { label: "Result", html: `<style>body{margin:0;background:#0a0a0a;height:100vh;font-family:ui-sans-serif,system-ui;} .stage{position:relative;height:100%;display:grid;place-items:center;} .front,.back{position:absolute;inset:0;display:grid;place-items:center;font-size:120px;font-weight:900;color:white;} .back{background:#ff3b1f;} .front{background:#1f8a5b;clip-path:inset(0 0% 0 0);animation:wipe 3s ease-in-out infinite alternate;} @keyframes wipe{from{clip-path:inset(0 0 0 0);}to{clip-path:inset(0 100% 0 0);}} </style> <div class="stage"><div class="back">AFTER</div><div class="front">BEFORE</div></div>` } ]} caption="The HTML and CSS that drives a linear wipe." /> ## Timing rules A wipe is its timing. Three rules I keep returning to: 1. **600-1000ms** for entrance wipes. Faster reads as a cut; slower reads as filler. 2. **300-500ms** for exit wipes. Exits should always be faster than entrances. 3. **Pair every wipe with motion in the incoming content.** A wipe that reveals static content feels half-finished. The wipe arrives, the content within it also has a tiny motion (scale 0.95 → 1, opacity 0.8 → 1). ## Compare: with and without a wipe The same cut, with and without a wipe between shots. The difference is small in any single moment; large across thirty seconds of content. ## Render to MP4 Same drill: open in the [playground](/playground), set the duration to a multiple of your wipe cycle, render. The output is a deterministic MP4 with frame-perfect wipe timing. `clip-path` is a quiet superpower. Two CSS properties, six transitions, no library. If you are still reaching for video editing tools to do basic wipes between HTML scenes, this is the swap to make this week. See also [motion graphics in 80 lines](/blog/motion-graphics-in-80-lines) for more on what code-driven transitions unlock. --- # AV1 vs H.264 vs H.265 for web video in 2026: the encoder showdown URL: https://hyperframes.video/blog/av1-h264-h265-web-video-2026 Published: 2026-05-14T20:00:00.000Z Tags: engineering, codecs, av1, h264, h265 Author: kira-tanaka I have been writing some version of "which codec should you ship" since 2019. It was easy in 2019 — the answer was H.264 and the only nuance was the profile. It got harder around 2022 when AV1 hardware decode started landing in consumer devices. It is hardest now, in 2026, because *all three* of H.264, H.265, and AV1 are viable, and the right answer genuinely depends on your audience, your tooling, and your willingness to ship multiple ladders. This post is the version of that explanation I find myself emailing every two weeks. I am writing it down so I can link it. ## Decoder support, as of May 2026 The single most important table in any codec post. Decoder support determines what you can ship; everything else is optimization. | Codec | Chrome | Edge | Safari | Firefox | iOS Safari | Android | Smart TV (avg) | |---|---|---|---|---|---|---|---| | H.264 | yes (HW) | yes (HW) | yes (HW) | yes (HW) | yes (HW) | yes (HW) | yes | | H.265 | yes (HW)¹ | yes (HW) | yes (HW) | yes (SW, 2025+) | yes (HW) | mixed | mostly | | AV1 | yes (HW²) | yes (HW²) | yes (HW, M3+/A17+) | yes (HW²) | yes (A17+) | mixed (Pixel 6+, S22+) | mixed | ¹ Chrome H.265 decode is hardware-only and requires a system codec. On Linux without the right install, it fails. ² AV1 HW decode requires a GPU shipped after late 2021 (RTX 30+, Intel Arc, Apple M3+, AMD RDNA 2+). Software decode is universal in modern browsers but expensive at 4K. The shape of this table changed dramatically in the last two years. AV1 went from "Chrome and Firefox" to "everywhere, with hardware acceleration on anything bought after 2022." H.265 finally landed in Firefox in 2025, ending its long exile. ## Bitrate vs quality, on real content The interesting question is not "what is the maximum quality" — it is "at the bitrate you actually want to ship, how does each codec look." We run our internal benchmarks against three reference clips: a chart wipe (high contrast, hard edges), an editorial title sequence (subtle gradients, motion blur), and a product demo (UI elements + sweeping camera moves). These are VMAF scores at 1080p60, encoded with the best widely-available encoder per codec (libx264 `--preset slow`, x265 `--preset slow`, SVT-AV1 `--preset 6`). Lower numbers are worse. | Bitrate | H.264 | H.265 | AV1 | |---|---|---|---| | 2 Mbps | 79.1 | 86.4 | 89.2 | | 4 Mbps | 87.6 | 91.8 | 93.7 | | 8 Mbps | 92.4 | 94.6 | 95.4 | | 16 Mbps | 95.1 | 96.0 | 96.4 | A few observations. AV1's advantage is largest at low bitrates — at 2 Mbps it is roughly 10 VMAF points ahead of H.264, which translates to "obviously cleaner" on side-by-side viewing. At 16 Mbps, all three look essentially identical; you are well past the point where the codec matters. The practical implication: AV1 is the right call for bandwidth-constrained delivery (mobile, low-tier streaming). H.264 is still fine for desktop and broadband. H.265 is the awkward middle child — better than H.264, worse than AV1, and a licensing minefield. ## Encoder availability and speed A codec is only useful if you can produce it at the speed you can afford. Speed numbers from our CI runner (16-core Xeon, no GPU), encoding a 10-second 1080p60 clip at 8 Mbps: | Encoder | Time | Notes | |---|---|---| | libx264 `--preset veryfast` | 1.8s | Baseline. Production-ready for live. | | libx264 `--preset slow` | 6.4s | The default for offline. Better quality. | | x265 `--preset slow` | 19.7s | Roughly 3× x264 at similar preset. | | SVT-AV1 `--preset 8` | 4.2s | Fast preset. Quality below libaom. | | SVT-AV1 `--preset 6` | 12.1s | Production default. | | libaom `--cpu-used 4` | 38.6s | Best AV1 quality. Almost never worth the time. | | WebCodecs HW (M3, AV1) | 1.9s | Hardware-accelerated. Apple Silicon only. | The shape here is what you would expect: H.264 is fast, H.265 is slow, AV1 ranges from "slow" (SVT-AV1) to "absurd" (libaom). Hardware encoders close the gap, but only on the platforms that have them. CI runners almost never have GPU encoders. If you are encoding once and serving many, AV1 with SVT-AV1 preset 6 is fine. If you are encoding per-request (personalized video, dynamic generation), AV1 is a non-starter without hardware. We covered the WebCodecs hardware path in detail in [WebCodecs for deterministic video](/blog/webcodecs-deterministic-video-2026). ## The licensing footnote nobody likes H.264 is patent-encumbered, but the patents have a clean licensing path through MPEG-LA. In practice, every browser, every OS, every chip ships with it. You pay nothing as a developer. H.265 is patent-encumbered through *multiple* incompatible pools (MPEG-LA, HEVC Advance, Velos Media). The legal exposure for shipping H.265 in a product is real and ambiguous. This is why H.265 was slow to land in browsers and why several large players actively avoid it. AV1 is royalty-free. AOMedia governs it; the founding members (Google, Mozilla, Microsoft, Cisco, Netflix, Apple, Amazon, Meta) have committed to not pursuing patent claims. This is the *single biggest reason* AV1 has displaced H.265 as the heir apparent — not the compression gains, the licensing. If you are picking a codec for a new product in 2026 and you have to pick one, the legal calculus favors AV1 over H.265 by a wide margin. ## CDN compatibility A practical thing that derailed several of our shipped projects: not every CDN can serve every codec equally well. Specifically, HLS and DASH support varies. - **H.264 in MP4 or fMP4 (HLS)**: works on every CDN, every player. The known-good path. - **H.265 in fMP4 (HLS)**: works on Apple platforms; works on most modern players; needs explicit configuration on some CDNs. Cloudflare Stream supports it; older Akamai configurations may not. - **AV1 in fMP4 (HLS)**: works in HLS since version 9. Player support is good in 2026 but you should test. - **AV1 in WebM (DASH)**: the cleanest AV1 delivery path. Universal browser support. Our deployment recommendation: ship an HLS ladder with H.264 (for compatibility) + AV1 (for efficiency on capable clients), and let the player negotiate. Skip H.265 unless you have a specific reason. ## A flowchart, because someone always asks Here is the simplified decision tree I sketch on whiteboards. ``` Is this content delivered to many viewers, encoded once? ├─ Yes → Ship AV1 + H.264 fallback. Done. └─ No (per-request render) ├─ Does the encoding machine have a GPU with AV1 HW encode? │ ├─ Yes → AV1. │ └─ No → H.264. └─ Is bandwidth the constraint? ├─ Yes → Suffer through software AV1. └─ No → H.264 and move on. ``` Notably absent: H.265 appears nowhere. In 2026, I cannot find a green-field reason to pick it. If you already have an H.265 pipeline shipping, keep it. If you are starting fresh, do not. ## What HyperFrames ships, and why For the curious: the [HyperFrames CLI](/docs) defaults to H.264 (`avc1.640028`, High@4.0) because it is universally playable. The `--codec av1` flag emits AV1, and on machines with hardware encode (Apple Silicon M3+, modern Nvidia/AMD with the right drivers), it is roughly the same wall-clock as H.264. The CLI will warn you if you ask for AV1 on a machine without hardware encode and the resulting render would be more than 10× the duration of the input. Our internal CI runs the full test suite against both codecs and verifies VMAF parity. The codec choice has not affected determinism since we moved to deterministic bitrate budgeting in late 2025 (the `--target-bitrate` and `--qp` flags). ## What to watch in the next 18 months A few things to keep an eye on, if you are operating at the codec layer. - **AV2** is in active development at AOM. Public encoders exist but are research-grade. Probably not shippable for another 24+ months. - **VVC (H.266)** decoder support has not landed in any browser. It is technically excellent and politically dead. - **JPEG XS** is becoming relevant for low-latency professional use (broadcast contribution, live editing). Not relevant for web playback. The story for the web is, for once, settling down. We have a fast, free, efficient codec (AV1), and a fallback (H.264) that works everywhere. The interesting work in 2026 is not picking codecs — it is everything *around* them: deterministic encoding, frame-accurate timing (see our [2026 status report on timing](/blog/frame-accurate-timing-browser-2026)), and the WebCodecs API maturing into something you can actually build on. That is, refreshingly, a kind of boring future. Good. --- # The three T's of editorial motion URL: https://hyperframes.video/blog/three-ts-of-editorial-motion Published: 2026-05-14T13:00:00.000Z Tags: design, typography, editorial, craft Author: marcus-okafor There is a category of motion design that I think of, slightly pretentiously, as *editorial*. It is what the New York Times graphics desk does. It is what The Pudding has built a career on. It is the unhurried, type-driven, fact-first style of motion that exists to communicate, not to impress. When it is done well, you do not notice the design at all. You notice the story. I have spent ten years trying to do this kind of work. I have failed at it more times than I can count — produced graphics that were technically correct, on-brand, well-eased, and somehow not editorial. The mistake has always been the same: I optimized for the *look* of the motion when I should have optimized for the *reading* of it. What separates editorial motion from decoration are three constraints that have to hold simultaneously. Type. Timing. Tonality. I want to walk through each of them, with examples from the work that taught me, and end with the composition I cannot show you (because it has not shipped) that finally clicked everything into place. ## Type: the thing that does the work In editorial motion, typography does 80% of the work and motion does 20%. This is the inversion that ad-style motion designers struggle with most. We come from a culture where motion is the show. In editorial, motion is the punctuation. This has a practical consequence: the type has to be *real* type. Not display type cribbed from a font pairing tutorial. Real, considered, professionally chosen type with optical sizes that adapt to scale, italic that means something, ligatures and small caps that show up where they should. The motion is supporting that type, not carrying the composition. My defaults for editorial work: - Headlines in a contemporary serif with strong italics. Newsreader, Source Serif Pro, Tiempos, Reckless. The italic is non-negotiable; it is how I emphasize the part of the headline that carries the argument. - Captions in a humanist sans or monospace. Inter, IBM Plex Sans, JetBrains Mono. Letter-spaced wide for category labels; standard for body. - One typeface family per composition. Never more than two. Pairing fonts is a way to look like you tried; using one family well is a way to look like you know. The other half of type in motion is *measure* — the width of the text column. CSS calls this `max-width`, and editorial designers obsess over it. A line that is too wide is fatiguing; a line that is too narrow is choppy. The traditional rule is 50-75 characters per line for body text, 25-45 for headlines. I have these numbers as CSS variables in my templates so I do not have to think about them. The third half (yes, there are three halves) is *hierarchy*. In a 15-second composition there might be three pieces of type: a category label, a headline, a caption. They should have three different sizes, three different weights, and three different roles. Confused viewers cannot read editorial motion; clear hierarchy is what makes them read. ## Timing: the rhythm of an argument Timing is the part of editorial motion that I find hardest to teach, because it is felt rather than measured. But there are heuristics, and the heuristics are what get newcomers in the door. The first heuristic: *reading time is the budget*. If a piece of text takes 2.5 seconds to read aloud at a comfortable pace, it needs to be on screen for at least 2.5 seconds — preferably 3.5 to give the viewer a beat after they finish. Most amateur editorial motion fails this. Text appears, you start reading, the text is replaced before you finish. The composition was timed to look good in the timeline, not to be read. The second: *animations should not compete with reading*. If a headline is on screen and you want the viewer to read it, do not animate other things at the same time. The eye cannot do both. Bring the headline in, then *stop*. Hold. Let the viewer read. Then animate the next element in. The third: *the gap between beats matters as much as the beats*. The "hold" between two pieces of information is where the viewer's brain does the work of connecting them. Skip the hold and the viewer is reacting to whatever just appeared; honor the hold and the viewer is *thinking*. A typical editorial beat structure I use for a 15-second composition: 1. **0-0.8s**: Category label fades in. Small, monospaced, top of frame. 2. **0.8-1.6s**: Headline enters with a settle. 3. **1.6-5.0s**: Hold. Nothing moves. The viewer reads. 4. **5.0-6.2s**: Supporting graphic (chart, image) enters. 5. **6.2-9.5s**: Hold. The viewer reads the graphic against the headline. 6. **9.5-11.0s**: Caption or attribution fades in. 7. **11.0-14.0s**: Hold. The composition completes. 8. **14.0-15.0s**: Exit. Half the composition is hold. New designers find this terrifying — "but nothing is happening!" — and that is exactly the point. The motion is the punctuation; the holds are the sentences. ## Tonality: the voice of the composition Tonality is the third T and the one that takes the longest to develop. It is the answer to the question "what does this composition sound like, if it had a voice?" The same content, animated with the same timing, can sound earnest or sardonic or breathless or quiet, depending on the choices that surround it. The choices that build tonality are mostly small. **Color saturation.** Desaturated palettes feel adult, considered, archival. Saturated palettes feel young, urgent, consumer. I tend toward 60-70% saturation for editorial work; full saturation reads as ad-style. **Motion amplitude.** How far things travel during their animations. A title that rises 12 pixels feels quiet; the same title rising 80 pixels feels emphatic. Editorial motion is usually 8-24 pixels of travel; ad-style motion is 40-200. **Easing personality.** I wrote about this in [easing that looks like money](/blog/easing-that-looks-like-money). Editorial work uses the settle and the professional in-out; it rarely uses the bounce or the anticipate-and-strike. The drama in editorial comes from the content, not from the curves. **Visual density.** Editorial frames have more empty space than ad frames. The viewer's eye should land in one place and stay there. A frame with five animated elements is a frame that is shouting; a frame with one animated element and four still ones is a frame that is talking. ## A real example, one I cannot show Last month I shipped a 22-second composition for a long-form journalism piece. The brief was: present a single statistic ("4.7 million") with a small contextual sentence, attribute it to a study, and end with a sign-off. The piece had to feel like a print magazine, not like a TikTok. The composition I shipped was almost embarrassingly simple. Cream background. One serif headline with the number set in italic. One paragraph of supporting text in a humanist sans, set to 65 characters per line, hanging below the number. One small monospace attribution in the corner. Three beats — number arrives, text arrives, attribution arrives — separated by long holds. The whole thing breathed. The editor I worked with on it said something I have been thinking about since: "It feels like a paragraph, not a video." That was the highest compliment I could have received. The motion had become invisible. What was left was the writing. This is the trick of editorial motion. The motion is in service to the writing. When you achieve this, the work stops being "motion graphics" in the genre sense and starts being something closer to *typography that moves*. Which is, I think, the better name for it. ## What changes when you accept the three T's A few things start to shift. You stop reaching for libraries. The plugins, the asset packs, the After Effects templates — they all add visual complexity. (For the longer take on why editorial motion fits poorly into a timeline tool at all, see the [After Effects comparison](/compare/after-effects).) Editorial work usually wants less complexity, not more. Your tool of choice becomes a text editor and a font. You stop showing your work. Junior motion designers want every animation to be *noticed*. Senior editorial designers want animations to be *felt without being noticed*. The viewer should leave the composition remembering the content, not the design. You start having opinions about every number. Why is the headline 96px? Because at this distance and at this measure, that is the size where the eye can read it without effort. Why is the hold 3.2 seconds? Because that is how long it takes to read the headline plus one beat. Every number has a reason. Editorial motion is a designed object, not an arranged one. You start to see editorial motion everywhere it exists, and to recognize its absence. The graphics packages on serious newscasts. The intros to documentaries. The credit sequences in literary films. The data visualizations in academic press. There is a small global community of designers doing this work, and once you start looking for it, you cannot stop. ## How to begin If you are convinced and want to start, the on-ramp is short. Pick a print magazine you admire. Read one feature. Notice how the type works on the page — the headline, the lede, the pull quotes, the captions. Now open a text editor and recreate one of those pages in HTML and CSS. Static. No motion. Just the type, right. Then, and only then, add one moving element — the [HyperFrames playground](/playground) is a good place to do this without setting up a project. Maybe the headline rises into place over 800ms with a cubic-bezier settle. Maybe nothing moves and the whole composition is a still that you render for fifteen seconds with a slow backdrop drift. The point of this exercise is to feel how little motion the composition needs to feel alive. When you are confident in one element of motion, add another. Then another. By the third element, you will have rules — your own rules — about what motion is allowed and what is not, in this composition's tonality. Those rules are the beginning of your editorial vocabulary. This is the work. It is slower than reaching for a preset, less impressive on a portfolio reel, and the most durable thing a motion designer can develop. Type. Timing. Tonality. The rest is variation. --- # The AI video landscape in 2026: Sora 2, Veo 3, and the gap deterministic rendering fills URL: https://hyperframes.video/blog/ai-video-landscape-2026 Published: 2026-05-14T13:00:00.000Z Tags: ai, video, strategy, research Author: hf-team The question we get most often, in 2026, goes something like: "If Sora 2 can produce sixty seconds of 4K video from a sentence, what is the point of writing HTML?" It is a fair question, asked in good faith, and we owe it a real answer rather than a defensive one. This post is that answer. The short version: generative video and deterministic rendering are not competitors. They are complementary primitives, and the interesting agentic systems of 2026 use both. The long version is below. ## The 2026 generative video field A snapshot, as of this week. Numbers are public pricing and roughly current capability; everything moves fast, so treat these as a directional snapshot rather than a benchmark. - **Sora 2 (OpenAI)**: 60s max, 4K, ~$0.40/second for the highest-quality tier. Released February 2026. The headline change from Sora 1 is consistent character identity across cuts and meaningful improvement on hands and text. Still struggles with rigid geometry — UI, charts, anything with hard edges and predictable motion. - **Veo 3 (Google)**: 60s, up to 4K, ~$0.30/second. Released March 2026. Strongest physics simulation of the lot. Best-in-class for liquids, smoke, fabric. Worse than Sora 2 at character consistency. - **Runway Gen-4**: 30s max, 1080p, ~$0.20/second. Released late 2025. Strongest editorial controls — reference images for style, camera trajectory inputs, motion brushes. The pro tool of choice for many real productions. - **Pika 3.0**: 15s, 1080p, ~$0.08/second. The fast, cheap option. Quality is noticeably below the leaders but the latency is good enough for ideation. - **Open weights (Hunyuan-Video 2, Mochi-2)**: Self-hosted, ~$0.05/second amortized on a single H100. Quality roughly equivalent to Pika 3.0; the value is control and privacy. What none of these will do, in 2026, is render the exact text you ask for at the exact pixel coordinates you ask for, the same way, twice in a row. That is not a flaw. It is the entire shape of how generative video works. The pixels are sampled, not specified. ## The deterministic workload, and why agents need it Deterministic rendering — the HyperFrames primitive — solves a different problem. You write HTML. You get an MP4. Same HTML, same MP4, byte-identical, every time. The reasons this matters: - **Snapshot tests.** If your video changes when the input changes and is identical when the input is identical, you can write `assertVideoUnchanged(prev, next)` and have it mean something. With Sora, you cannot. - **Diff-able output.** Agents iterate by comparing the result of action *n* to action *n+1*. If the underlying renderer adds noise, the comparison is unreliable. - **Pixel-exact text.** A chart with the label "$47,832.12" needs to render exactly that string, in the corporate brand font, at the corporate brand position. Generative models will produce "$47,832.12" sometimes and "$47.832,12" other times. - **Sub-second iteration.** A render of a 5-second composition takes 1-2 seconds. A Sora generation takes 30-90s. For an agent looping on visual feedback, the difference is the whole loop. The deeper version of this is in our post on [why determinism is the unlock](/blog/deterministic-video-manifesto). The short version: agents are loops, loops need feedback, feedback needs to be comparable, comparability needs determinism. ## A taxonomy of video workloads Here is the mental model we use internally when teams ask "which tool for which job." We carve video workloads along two axes: how much imagination is needed, and how much precision is needed. Each quadrant has a clear winner. - **High imagination, low precision** (an establishing shot of a city at dawn): Sora 2 or Veo 3. There is no other reasonable answer. - **Low imagination, high precision** (a chart with quarterly revenue, a product release video, a UI demo): HyperFrames or another deterministic renderer. Generative models cannot get the numbers right. - **High imagination, high precision** (a character delivering exact dialogue with exact branded backdrops): a hybrid pipeline. Generate the character with Sora; composite a deterministic lower-third with HyperFrames; mux them together. - **Low imagination, low precision** (a stock B-roll cut of waves on a beach): stock footage. Honestly. Neither tool is the right answer. The interesting work, in 2026, is mostly in the high-imagination-high-precision quadrant. That quadrant did not exist in a usable form until both halves matured. ## What hybrid pipelines actually look like Three patterns we see in production, with real customers: **Pattern 1: Generative B-roll, deterministic chrome.** A marketing team generates a 30-second sequence of conceptual footage with Veo 3, then composites a deterministic title sequence, lower-thirds, and end card from HyperFrames on top. The deterministic layer is what the brand reviews; the generative layer is what the brand vibes on. **Pattern 2: Personalized data video.** An ops team generates 50,000 personalized year-in-review videos. Each one has the user's name, their actual usage data, their actual top three categories. None of that can come from a generative model — it has to be pixel-exact. But the *backdrop* — the abstract animated scene behind the data — is generated once with Sora and reused. Cost: under a cent per personalized video. **Pattern 3: Agent + reviewer loop.** An agent generates HTML for a chart, renders it deterministically with HyperFrames, evaluates the output (does the chart make the point?), iterates. Once the deterministic part looks right, a second pass uses a generative model to fill the "hero" part of the composition. The agent never tries to control the generative output frame-by-frame, because it can't. ## Cost math, for the spreadsheet-curious A back-of-the-envelope for a 10-second 1080p personalized video, generated 100,000 times: - Sora 2 only: $0.40/sec × 10s × 100,000 = $400,000. (Also: nondeterministic, so 100,000 reviews.) - HyperFrames only: roughly $0.0015 per render at scale = $150. (Deterministic, reviewable as a single template.) - Hybrid (Sora backdrop generated once, deterministic composite per render): $0.40 × 10 × 1 (backdrop) + $0.0015 × 100,000 = $154. The economics are not subtle. Generative video is priced per call because each call genuinely costs the provider real GPU-seconds. Deterministic video is priced per render because each render is mostly browser-time, which is cheap and parallelizable. For one-of-a-kind hero content, the generative cost is fine. For personalized-at-scale content, deterministic is the only economically possible option. ## What changed in the last 18 months Three things, in our reading. First, generative models got good enough that *we* started using them for things HyperFrames cannot do. The agent we use internally to draft blog post header animations now generates a Veo 3 backdrop and composites HyperFrames text on top. We did not predict that workflow two years ago. Second, the price of generative video dropped roughly 4× in 18 months. This makes hybrid economically possible for use cases where it was not before. Third, agents got good enough at writing HTML that the deterministic side stopped being the bottleneck. In 2023 the bottleneck was "can an LLM write a passable HTML animation?" In 2026 the answer is "yes, on the first try, given a clear spec." That changed the calculus on what to build deterministically vs generatively. We wrote more about the agent side of this in [why AI agents need deterministic rendering](/blog/ai-agents-need-deterministic-rendering). ## What we do not do, and will not do People sometimes ask if HyperFrames will integrate generative video directly — "render this HTML, then have Sora fill in this region." We will not, for now. Not because it is uninteresting, but because the right primitive for that workflow is *composition*, not bundling. You should be able to plug whatever generative model you want into the pipeline — Sora today, whatever-comes-next tomorrow — without us building the integration. The CLI exposes `--background-video` for exactly this purpose; pipe in whatever you generated, get a composited MP4 out. This is also why our docs lean on showing the layering pattern rather than promoting any specific provider. The interesting unit is not "video generator + video renderer" — it is "video pipeline" with replaceable parts. ## The 2026 question, answered If Sora 2 can produce sixty seconds of 4K video from a sentence, what is the point of writing HTML? The point is that some video is not "from a sentence" — it is from a database row, a brand kit, a customer profile, an agent's plan, a spreadsheet, an A/B test. That video needs to look like the data, not like a guess at the data. Generative models will keep getting better at imagination. They will not get better at obedience to precise structure, because that is not what they are. The right question for 2026 is not "Sora *or* HyperFrames." It is: "for *this* video, what part wants to be imagined, and what part wants to be specified?" If you can answer that honestly, the rest of the architecture writes itself. If you want to play with the deterministic side, the [playground](/playground) is the fastest path in. If you want to wire it into an agent stack, the [developer docs](/developers) cover the API. We will keep watching the generative side and writing about it when it changes — which, at the current pace, is roughly every six weeks. --- # CSS gradient animation that doesn't look like 2014 URL: https://hyperframes.video/blog/css-gradient-animation-tutorial Published: 2026-05-14T09:00:00.000Z Tags: css, gradient, tutorial, motion Author: marcus-okafor The animated CSS gradient had a bad decade. Every SaaS landing page from 2017 to 2022 used the same shifting-purple-to-pink loop, and the technique got tarred with the trend. But the underlying primitive — interpolating colors across a surface, smoothly, over time — is genuinely useful, especially in video. Used carefully, an animated gradient can do the work of a moving graphic without any moving parts. Here are five gradient techniques that still hold up, what they're good for, and how to make them not look generic. ## 1. The slow hue rotation (background) The simplest version: a linear-gradient between two near-complementary colors, with `background-size` larger than the viewport, and a `background-position` animation that slides over 20–40 seconds. The slowness is the point — anything under 10s reads as "loading." <InlineSandbox html={`<!doctype html> <html><body style="margin:0;height:100vh;overflow:hidden;"> <div style="position:absolute;inset:0;background:linear-gradient(135deg,#ff3b1f,#7b2cff 45%,#1f5fff 90%);background-size:300% 300%;animation:slide 24s ease-in-out infinite alternate;"> </div> <div style="position:relative;height:100%;display:grid;place-items:center;color:#fff;font:600 64px ui-sans-serif,system-ui;letter-spacing:-.02em;mix-blend-mode:plus-lighter;">Brand · 2026</div> <style>@keyframes slide { 0%{background-position:0% 0%;} 100%{background-position:100% 100%;} }</style> </body></html>`} height={320} caption="Hero background — 24-second loop, no JavaScript." /> Two rules: use **three colors, not two** (linear-gradients between two colors look like a paint chip), and keep the angle off-axis (`135deg` not `90deg`). Both moves get you out of the 2017 aesthetic. ## 2. The mesh gradient (multiple radial overlays) A "mesh gradient" — Stripe-style, smooth, painterly — is faked in CSS with three or four `radial-gradient` layers, each positioned in a different corner, all composited together. Animate each layer's position separately and the result moves like a painter's wet canvas. <VariableKnobs html={`<style> :root { --c1: {{$C1}}; --c2: {{$C2}}; --c3: {{$C3}}; --c4: {{$C4}}; } body{margin:0;height:340px;position:relative;overflow:hidden;background:#0a0a0a;} .mesh{position:absolute;inset:0; background: radial-gradient(at 20% 30%, var(--c1) 0%, transparent 50%), radial-gradient(at 80% 20%, var(--c2) 0%, transparent 50%), radial-gradient(at 70% 80%, var(--c3) 0%, transparent 50%), radial-gradient(at 20% 80%, var(--c4) 0%, transparent 50%), #0a0a0a; filter:blur(40px); animation:m 16s ease-in-out infinite alternate;} @keyframes m { 0%{transform:scale(1) translate(0,0);} 100%{transform:scale(1.15) translate(-4%,3%);} } .lbl{position:absolute;inset:auto 24px 24px 24px;color:#fff;font:600 36px ui-sans-serif,system-ui;letter-spacing:-.02em;} </style> <div class="mesh"></div> <div class="lbl">{{$LABEL}}</div>`} knobs={[ { name: "LABEL", label: "Label", default: "Series A — closed" }, { name: "C1", label: "Corner 1", type: "color", default: "#ff3b1f" }, { name: "C2", label: "Corner 2", type: "color", default: "#ff9b00" }, { name: "C3", label: "Corner 3", type: "color", default: "#2b66ff" }, { name: "C4", label: "Corner 4", type: "color", default: "#7b2cff" } ]} /> The `filter: blur(40px)` is doing most of the work — it smears the four radial overlays into a continuous field. Without the blur, you can see the discrete radials. ## 3. The conic sweep `conic-gradient` rotates through colors around a center point. Animate the `--rotation` custom property and the colors orbit. This is the trick behind every "live border" gradient you've seen on a Twitter follow button. ```css @property --r { syntax: '<angle>'; initial-value: 0deg; inherits: false; } .glow { background: conic-gradient(from var(--r), #ff3b1f, #7b2cff, #1f5fff, #ff3b1f); animation: spin 4s linear infinite; } @keyframes spin { to { --r: 360deg; } } ``` Use it as a `mask` or pair it with a thicker outline on a dark element — a glowing border that reads as expensive. ## 4. The animated noise layer (the secret one) This is the move that separates "designed gradient" from "CSS demo gradient." Add a low-opacity SVG noise overlay on top of your gradient. The grain breaks the banding artifacts that plague large gradient regions and adds a film-like texture. ```html <svg style="position:absolute;inset:0;opacity:.08;mix-blend-mode:overlay;"> <filter id="n"> <feTurbulence type="fractalNoise" baseFrequency=".9" numOctaves="2"/> </filter> <rect width="100%" height="100%" filter="url(#n)"/> </svg> ``` Layer that on every gradient. It costs nothing and elevates everything. ## 5. The "lava lamp" — blurred shapes behind glass Two or three circles in primary brand colors, large (300px+), blurred heavily (60–100px), animated to drift on independent slow loops. Put a `backdrop-filter: blur(40px)` glass layer in front. Result: a Stripe-payment-form aesthetic without any of the JS. ## The video gotcha: banding The single biggest reason "my gradient looked great in the browser but terrible in the MP4" is banding. Video codecs aggressively quantize smooth color regions, and a 24-bit gradient becomes a 6-band mess at H.264 bitrate. Three fixes, in order: 1. **Add noise.** (See above.) Noise breaks the bands by adding entropy the codec can't quantize away. 2. **Use lower contrast.** A gradient from `#ff3b1f` to `#1a0010` has 70% less banding than `#ff3b1f` to `#000000`. 3. **Render at higher bitrate.** [HyperFrames](/tools/html-to-video) defaults to CRF 18, which is well above where banding kicks in. ## Putting it together A landing-page hero usually wants #1 + #4. A social media still wants #2 + #4. A video intro wants #2 + #3 + #4. The thing tying them together is #4 — always add noise. [Open the playground](/playground), pick a gradient, render the export. Twenty seconds of motion, zero JavaScript, zero framerate concerns. --- # Generate 1,000 personalized videos from a CSV URL: https://hyperframes.video/blog/batch-personalized-videos-from-csv Published: 2026-05-14T09:00:00.000Z Tags: batch, csv, personalization, developers Author: kira-tanaka The most common request from marketing teams who learn about deterministic video rendering: "Can we send a personalized MP4 to every name on our list?" The answer, with a CSV and a template, is yes — and the cost is roughly $4 per thousand at our infrastructure prices, not the $4 per *video* you would pay a freelancer. Here is the pattern that makes it work in CI, without spinning up a custom service. ## The shape of the data A flat CSV. One row per recipient. Columns become template variables. ```csv name,company,metric,delta Jordan,Acme,MRR,24.6 Priya,Globex,signups,38.1 Marcus,Initech,activations,17.2 ``` Three rules: 1. Keep it flat. If you need nested data, denormalize. 2. UTF-8 with no BOM. Most CSV tools do the right thing; some do not. 3. Header row is canonical. The template references `{{$NAME}}`, not column index 0. ## The template A single HTML file with `{{$VAR}}` placeholders. We covered the [personalized card pattern](/blog/render-video-from-nextjs-route) for the on-demand case; the batch case uses the exact same template. The template is the variable surface. If the brand changes the layout, you change it here. If the brand changes the recipient list, you change the CSV. Neither change affects the other. ## The renderer loop The skeleton, in any language: ```ts const csv = fs.readFileSync('recipients.csv'); const rows = parse(csv, { columns: true }); await Promise.all(rows.map(async (row) => { const html = template .replace(/\{\{\$NAME\}\}/g, row.name) .replace(/\{\{\$COMPANY\}\}/g, row.company) .replace(/\{\{\$METRIC\}\}/g, row.metric) .replace(/\{\{\$DELTA\}\}/g, row.delta); const mp4 = await renderHtmlToMp4(html, { width: 1920, height: 1080, duration: 6 }); await fs.promises.writeFile(`out/${row.name}.mp4`, mp4); })); ``` Two notes: 1. **Replace, not template-engine.** Avoid Handlebars / Mustache here; their parsers do too much. A regex replace is faster, simpler, and predictable. 2. **`Promise.all` is too aggressive.** For 1,000 rows, you want a concurrency limit (8-32 depending on your worker size). Use `p-limit` or the equivalent in your runtime. ## The concurrency bound A single Chromium instance can drive one render at a time. The bottleneck is process count, not CPU — each render uses ~400MB of RAM. The math on a typical CI worker: For larger batches, parallelize across [GitHub Actions matrix](/integrations/github-actions) jobs. Each runner gets a slice of the CSV; outputs accumulate in S3. ## The variable preview What changes per row. Drag the knobs to see the per-recipient variation: <VariableKnobs html={`<style>body{margin:0;background:#f6f5f1;height:100vh;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .c{padding:64px;background:white;border-radius:20px;box-shadow:0 20px 60px rgba(0,0,0,.1);min-width:520px;} .h{font-size:18px;color:#6b6862;letter-spacing:.2em;text-transform:uppercase;} .n{font-size:72px;font-weight:800;margin:12px 0 24px;} .m{font-size:24px;color:#6b6862;} .d{font-size:64px;font-weight:800;color:#ff3b1f;font-variant-numeric:tabular-nums;}</style> <div class="c"><div class="h">For {{$COMPANY}}</div><div class="n">Hi {{$NAME}}.</div><div class="m">Your {{$METRIC}} grew</div><div class="d">+{{$DELTA}}%</div></div>`} knobs={[ { name: "NAME", label: "Name", default: "Jordan" }, { name: "COMPANY", label: "Company", default: "Acme" }, { name: "METRIC", label: "Metric", default: "MRR" }, { name: "DELTA", label: "Delta", default: "24.6" } ]} caption="The same template — change any knob, the MP4 changes." /> ## Cost per render Real numbers from our infrastructure for a 6-second, 1080p, 30fps render: - Compute: ~$0.003 per render (GitHub Actions Large runner, amortized) - Storage: ~$0.0005 per render (S3 standard, 1 month retention) - Egress: ~$0.0001 per delivered view A 1,000-recipient campaign costs about $4 to produce and store. The freelancer rate for personalized video is $1-5 per *output*, so the pattern is roughly three orders of magnitude cheaper. See [our 10K variants overnight piece](/blog/render-10k-variants-overnight) for the full breakdown. ## Distribution Once you have 1,000 MP4s in S3, the delivery options: - **Email.** A signed `<video>` URL or an animated GIF preview that links to the MP4. Most clients block autoplay; use a poster image. - **Personalized landing pages.** A URL per recipient (`/v/{token}`) that loads their MP4. Easy to instrument click-through. - **Direct DMs.** Slack/LinkedIn message with the video attached. Pick one. Most teams overcomplicate this; a single delivery surface is usually right. ## What this enables The line between "marketing video" and "marketing email" used to be that emails were personalized and videos were not. Once videos are also a function call over a CSV, the line goes away. Every email can have a 6-second MP4 in it that *says the recipient's name*. The novelty wears off in a few quarters; the engagement lift does not. If you want to wire this into your CI today, the [GitHub Actions integration](/integrations/github-actions) has a copy-paste matrix workflow. Pair with the [marketing use case](/use-cases/marketing) for examples in the wild. The CSV is the spec; the template is the contract; the MP4s are the build artifacts. Same pattern that made deterministic builds eat the software industry. It is eating video next. --- # Animated KPI cards that look like money URL: https://hyperframes.video/blog/animated-kpi-stat-cards Published: 2026-05-13T15:00:00.000Z Tags: kpi, data-viz, css, tutorial Author: ren-park A KPI card is one number, one label, and one trend indicator. It is the most overrepresented widget in modern dashboards and the most overpaid-for asset in revenue marketing. The good version costs zero dollars and renders in 40 lines. Here is what makes a KPI card feel like a *result* instead of a *cell*. ## The four parts 1. **The big number.** Tabular numerals, heavy weight, oversized relative to the card. 2. **The label.** Small, tracked-out, all caps, quiet. 3. **The delta.** Up or down, with an arrow, in the brand-positive or brand-negative color. 4. **The sparkline.** Tiny, abstract, supporting. Get those four right and the card carries weight. Get them wrong and it looks like a Bootstrap demo. ## The count-up The single feature that separates "data widget" and "story." The number animates from 0 to its final value as the card enters. The easing must be the [settle curve](/blog/easing-that-looks-like-money) — anything linear feels mechanical, anything bouncy feels childish. ```javascript function countUp(target, duration, el) { const start = performance.now(); function tick(now) { const t = Math.min(1, (now - start) / duration); const ease = 1 - Math.pow(1 - t, 3); el.textContent = (target * ease).toFixed(1); if (t < 1) requestAnimationFrame(tick); } requestAnimationFrame(tick); } ``` If this lives in a [deterministic render pipeline](/blog/deterministic-video-rendering-ci), rewrite it as `render(t)` and the renderer will drive the time. The CSS animation runs at 60fps in the browser; the renderer captures frames at the requested intervals. ## The typography Two non-negotiable CSS declarations: ```css .number { font-variant-numeric: tabular-nums; font-feature-settings: 'tnum'; } ``` Without these, the digits dance during the count-up — `7` is narrower than `1`, the layout reflows, the eye reads it as broken. With them, every digit has identical width, the count-up reads as a single block of numbers. The font weight matters too. KPI numbers want 800 or 900, not 600. The weight communicates *importance*. A semi-bold KPI looks tentative. ## The delta indicator A small chip above or beside the number: `↑ 24.6%`. Two rules: 1. **Color is signal.** Green for up-and-good, red for down-and-bad, but watch your context — a "bounce rate is down" delta should be green, not red. The semantic mapping is "is this the desired direction," not "is this an increase." 2. **The arrow is part of the type.** Use the Unicode glyphs (`↑ ↓`) and the same font as the number. Bolted-on SVG arrows fight the type. The delta animates in *after* the number has settled. 200ms gap; the eye lands on the number, then reads the context. Reverse this and the eye splits. ## The sparkline Tiny chart inside the card, typically 60-80px wide and 24-32px tall. It does not need to be legible at a data-point level; it needs to communicate *trend*. SVG `<polyline>` with three points works fine: ```html <svg viewBox="0 0 100 30"> <polyline fill="none" stroke="currentColor" stroke-width="2" points="0,25 25,18 50,22 75,8 100,5" /> </svg> ``` For real data, downsample to ~10 points; more is visual noise at that scale. ## The variable preview The four knobs that turn the card into your brand: <VariableKnobs html={`<style>body{margin:0;background:#f6f5f1;font-family:ui-sans-serif,system-ui;height:100vh;display:grid;place-items:center;} .c{background:white;border-radius:18px;padding:36px 44px;border:1px solid #e3dfd3;box-shadow:0 20px 60px rgba(0,0,0,.08);min-width:340px;} .l{font-size:11px;letter-spacing:.3em;text-transform:uppercase;color:#6b6862;} .n{font-size:84px;font-weight:900;letter-spacing:-.04em;font-variant-numeric:tabular-nums;color:#0a0a0a;line-height:1;margin:8px 0 6px;} .d{font-size:18px;font-weight:700;color:{{$ACCENT}};display:flex;align-items:center;gap:6px;}</style> <div class="c"> <div class="l">{{$LABEL}}</div> <div class="n">{{$PREFIX}}{{$VALUE}}{{$SUFFIX}}</div> <div class="d">↑ {{$DELTA}}% <span style="color:#6b6862;font-weight:400;font-size:14px;margin-left:6px;">vs last quarter</span></div> </div>`} knobs={[ { name: "LABEL", label: "Label", default: "Monthly recurring revenue" }, { name: "VALUE", label: "Value", default: "248,400" }, { name: "PREFIX", label: "Prefix", default: "$" }, { name: "SUFFIX", label: "Suffix", default: "" }, { name: "DELTA", label: "Delta %", default: "24.6" }, { name: "ACCENT", label: "Delta color", type: "color", default: "#1f8a5b" } ]} height={320} /> ## Compare: static vs animated Same data, two treatments. The animated one earns more attention for the same number. <CompareSlider beforeHtml={`<style>body{margin:0;background:#f6f5f1;font-family:ui-sans-serif,system-ui;height:100vh;display:grid;place-items:center;}.c{background:white;border-radius:18px;padding:36px 44px;border:1px solid #e3dfd3;min-width:340px;}.l{font-size:11px;letter-spacing:.3em;text-transform:uppercase;color:#6b6862;}.n{font-size:84px;font-weight:900;letter-spacing:-.04em;font-variant-numeric:tabular-nums;line-height:1;margin:8px 0 6px;}.d{font-size:16px;color:#1f8a5b;}</style><div class="c"><div class="l">MRR</div><div class="n">$248,400</div><div class="d">↑ 24.6% vs last quarter</div></div>`} after="kpi-card-pop" labelBefore="Static" labelAfter="Animated" caption="The number is the same. The animation is the editorial choice." /> ## A note on "make number go up" A KPI card with a *down* delta is the harder case. The temptation is to soften it visually — gray instead of red, no arrow, smaller font. Don't. If the number is down, the card should make that clear. The whole point of a KPI card is fast read; obscuring the bad news makes the card lie. The respectful design move is to color the delta red, keep the arrow, and let the *next* element in the layout (a callout, a context note) explain why. Honesty > comfort in dashboards. ## When the count-up is wrong A specific case worth flagging: very large numbers count up too slowly. If the value is `$2.4M`, animating from `$0.0M → $2.4M` over a second is fine. If the value is `1,247,392`, animating digit-by-digit looks bizarre. Round during the count-up: animate the rounded value (`1.2M → 1.25M`) and reveal the precise number at the end. This is the same principle as the [easing cheatsheet](/blog/easing-curves-cheatsheet) — pick a representation that fits the moment, not the data. ## Render to MP4 Once the card looks right at the dimensions you need (1920×1080 for hero animations, 1200×630 for OG images, 1080×1080 for social), render. Same pipeline as everything else: open the [playground](/playground), pick dimensions, render. See the [use-cases page](/use-cases) for the surfaces these end up on. A KPI card is the smallest unit of revenue marketing. It is the cheapest thing to make well and the most expensive thing to make badly — bad ones get screenshotted and shared internally as "do not do this." The 40 lines above are the cheap, good version. Use them. --- # WebCodecs for deterministic video rendering in 2026 URL: https://hyperframes.video/blog/webcodecs-deterministic-video-2026 Published: 2026-05-13T14:00:00.000Z Tags: engineering, webcodecs, encoding, chromium Author: kira-tanaka I have been writing the same paragraph for three years. It goes: "WebCodecs is interesting, but we still encode out of band with ffmpeg, because the API is too unstable to bet a deterministic pipeline on." I cannot write that paragraph anymore. Chromium 130 shipped enough of the missing pieces that, sometime this spring, the WebCodecs path inside HyperFrames quietly went from "experimental" to "the default we are migrating toward." This post is the long version of why. If you are building a browser-side video pipeline in 2026, the question is no longer *whether* to use WebCodecs. It is *which parts*, on *which platforms*, with *which fallbacks*. ## What actually shipped between 2023 and 2026 The WebCodecs spec stabilized in 2022, but the interesting changes are in the implementation. A short list of things that did not exist three years ago and exist now: - `VideoEncoder` with reliable hardware acceleration on every major platform. - AV1 hardware decode on Apple Silicon (M3+), AMD RDNA 3+, Intel Arc, and any Nvidia card from the 4000 series forward. - AV1 hardware *encode* on the same hardware, with the giant asterisk that the quality is still behind libaom at the same bitrate. - `VideoFrame.copyTo()` with explicit pixel format negotiation. You can ask for `I420`, `NV12`, `RGBA`, and the browser will tell you what it can give you without lying. - A `latencyMode: "realtime"` flag that meaningfully changes scheduling. - A `requireHardwareAcceleration` flag that fails fast instead of silently dropping you onto a software encoder. The last one matters more than it sounds. In 2023, you would request "h264" and the browser would happily hand you a software encoder when hardware was unavailable, and you would find out from your CPU graph forty seconds into a render. Now you can demand hardware, and the construction fails synchronously if you cannot have it. That is what an API for serious work looks like. ## The probe Here is the rough shape of what we run on first render of a fresh process. It is the first thing that gets logged. If you are building anything similar, copy this idea before you copy anything else. ```ts async function probeEncoder() { const candidates = [ { codec: "av01.0.05M.08", hardwareAcceleration: "prefer-hardware" as const }, { codec: "avc1.640028", hardwareAcceleration: "prefer-hardware" as const }, { codec: "avc1.640028", hardwareAcceleration: "no-preference" as const }, ]; for (const cfg of candidates) { const support = await VideoEncoder.isConfigSupported({ ...cfg, width: 1920, height: 1080, bitrate: 8_000_000, framerate: 60, }); if (support.supported) return support.config!; } throw new Error("No usable VideoEncoder configuration"); } ``` This snippet does three things people get wrong. It uses fully-qualified codec strings (`avc1.640028` is High@4.0; `avc1.42E01E` is Baseline@3.0 and will absolutely make your gradients banded). It calls `isConfigSupported` instead of trusting the codec name. And it falls through, in order, to progressively less ambitious configurations until something works. ## H.264 vs AV1, in 2026, on real machines I am going to give you numbers, because that is what I always wanted when I was reading posts like this. These are from our internal benchmark suite: a 10-second 1080p60 render of the same composition (a chart wipe with subtle motion), encoded at 8 Mbps target bitrate, measured wall-clock from `encoder.encode()` first call to last `output` callback. | Platform | Codec | Hardware | Encode time | VMAF | |---|---|---|---|---| | M3 Pro (macOS 15) | H.264 | VideoToolbox | 1.4s | 92.1 | | M3 Pro | AV1 | VideoToolbox (M3+) | 2.1s | 94.6 | | Ryzen 9 7950X + RX 7900 | H.264 | VCN 4.0 | 1.6s | 91.4 | | Ryzen 9 7950X + RX 7900 | AV1 | VCN 4.0 | 2.4s | 93.8 | | Linux CI (Intel Xeon, no GPU) | H.264 | software (OpenH264) | 6.8s | 90.2 | | Linux CI (Intel Xeon, no GPU) | AV1 | software (libaom realtime) | 41.3s | 95.1 | A few things to take from this table. Hardware AV1 is no longer a science project — on a recent consumer GPU it is faster than software H.264. But on CI runners without GPU acceleration, AV1 is still a non-starter for anything resembling real-time. The VMAF deltas are real but small at this bitrate; AV1 pulls further ahead at lower bitrates (3-4 Mbps), where it is roughly 25-30% more efficient. The practical recommendation we ended up with: AV1 when the user's machine supports hardware encode, H.264 otherwise. The CDN side has caught up; AV1-in-MP4 (`av01` in an `mp4` container) plays in every browser shipped since mid-2024. ## Why we are migrating the render path The HyperFrames render pipeline used to be: capture PNG frames from headless Chromium, pipe them to ffmpeg over stdin, ffmpeg encodes. This works, and it is what powers the bulk of production renders today. The problem is the PNG step. Every frame round-trips through PNG encode in Chromium, PNG decode in ffmpeg, then YUV conversion, then encode. On a 1080p60 5-second render, that's 300 PNG encodes (~12ms each on M3) before any pixel reaches the encoder. With WebCodecs we can capture a `VideoFrame` directly from an `OffscreenCanvas` (or from a tab via the `CaptureController` API, which finally became stable in Chromium 128) and feed it to `VideoEncoder` with no intermediate PNG. On the same M3 box, that 5-second render drops from 4.8s to 2.1s end-to-end. Roughly half the wall clock came from the PNG round-trip. The catch is that this only helps when capture and encode happen in the same browser context. For our headless-Chromium CLI path, where we pull screenshots from a separate process via CDP, we still pay the PNG cost (or use our raw-pipe mode, covered in [From DOM to MP4](/blog/from-dom-to-mp4)). For the browser-side playground at [/playground](/playground), WebCodecs is now the default path. ## Determinism, the part where I usually get nervous Three years ago I would not have used WebCodecs for anything that needed to produce byte-identical output across machines. Hardware encoders are notoriously slippery: Apple's VideoToolbox and AMD's VCN make slightly different rate-control decisions on the same input. In 2026 the situation is more nuanced. For *bit-exact* output across machines, you still need software encoding. We use `x264` (via WASM) for our reproducibility-critical tests, and the new `requireHardwareAcceleration: "no"` flag finally lets us force the WebCodecs path onto the WASM fallback inside Chromium. That gives us deterministic encode without leaving the browser. For *visually identical* output — same VMAF, same SSIM within tolerance — hardware AV1 is now consistent enough that we have stopped chasing the last few bits. Our CI compares VMAF deltas with a tolerance of 0.05 and we have not seen a real regression in six months. ## Latency: the surprise win The benchmark that surprised me most was real-time encode latency — the time from `encoder.encode(frame)` to the corresponding `output(chunk)` callback. With H.264 on VideoToolbox in `latencyMode: "realtime"`, this is consistently under 4ms. That is fast enough for interactive use cases I assumed would need a native app. We are now using WebCodecs for the live preview in the playground. As you drag the scrubber, frames are captured from the offscreen canvas, encoded, and packed into an MSE buffer for playback. The whole loop runs at 60fps on a 2022 MacBook Air. Two years ago this would have required a desktop GPU. ## Browser support reality check Because someone always asks: WebCodecs ships in Chromium (130+), Edge (current), Safari (17.4+), and Firefox (since 130, behind a flag until 131, default since 132). The Firefox encode path is software-only as of this writing, but the decode path uses hardware. If you are writing a public-facing tool and you want WebCodecs to be a happy path on Firefox, you should provide a software-encoder fallback, and you should not pretend AV1 hardware encode exists outside of Chromium and Safari on M3+. That is the honest answer. ## What still does not work I have spent the post being optimistic. Let me end with the open issues, because they are real. - **Color management in WebCodecs is a wreck.** `VideoFrame` carries a color space, but the encoder's handling of BT.709 vs sRGB vs Display P3 is implementation-defined. If your composition uses wide-gamut colors, expect the encoded MP4 to look subtly different on different machines. - **Frame queue limits are stingy on hardware encoders.** VideoToolbox in particular will refuse to accept more than ~16 frames in flight, which means a naive "encode everything then await all" loop will deadlock. You need backpressure. - **Error reporting from hardware encoders is opaque.** You get a generic `EncodingError` and a string. We have built a small library of "what this string actually means" notes that we will probably open-source eventually. - **AV1 keyframe spacing on VCN is buggy** in driver versions before 24.10. Render a long video and you may get a 90-second GOP that nothing seeks well. Force `keyFrameEveryNFrames` and verify. For most of these I expect another year of incremental fixes and then the situation will be boring, which is the best thing you can say about a low-level API. ## Where to start If you want to play with this without leaving your browser, the [playground](/playground) renders via WebCodecs by default in Chromium and Safari 17.4+. The "Show export details" panel in the render dialog now displays the codec, hardware/software status, and encode latency per chunk. It is the same panel we use internally to debug encoder regressions. If you are integrating WebCodecs into your own pipeline, start with the probe pattern above, demand `requireHardwareAcceleration: "prefer-hardware"` rather than blocking on it, and write a backpressure-aware encode loop before you write anything else. Everything beyond that is tuning. WebCodecs in 2026 is no longer the future. It is the present, and the parts of our render path that still go through ffmpeg are, for the first time, the legacy ones. --- # Render 10,000 variants overnight URL: https://hyperframes.video/blog/render-10k-variants-overnight Published: 2026-05-13T14:00:00.000Z Tags: infrastructure, ci, scale, batch Author: hf-team The first time a customer asked us to render ten thousand video variants, we said yes before we knew how. That was nine months ago. We have since shipped that customer's overnight batch every weekday since, plus a half-dozen similar batches for other teams, and the architecture has hardened into something we can describe without hand-waving. I want to spend this post walking through it, with real numbers, because "render at scale" is one of those features that is easy to claim and hard to do correctly. The setup: a fintech customer running a referral campaign. Each user gets a personalized 12-second video showing their referral count, their dollars earned, their next milestone, and a sign-off card. The variant is parameterized by user ID. They have around 10,000 active referrers. Every weeknight at 11pm Eastern, our system renders fresh videos for everyone whose stats changed that day — typically 4,000 to 7,000 variants — and uploads them to S3 by 6am. The math: 5,000 variants × 12 seconds × 30 fps = 1.8 million frames per night. At ~10ms per frame on warm Chromium that is around 5 CPU-hours of work. Spread across 32 cores it is ten minutes wall time. In practice the batch takes 35-50 minutes, because the long pole is encoding and upload, not rendering. We will get to those. ## The architecture, briefly The system has four components and they live in this order. **The orchestrator** is a small Node service (the same shape we document for the [GitHub Actions integration](/integrations/github-actions)) that wakes up on a schedule, queries the customer's data warehouse for the delta of users whose stats changed, and enqueues a render job per user into Redis. The job payload is the user's parameter bag: `{ name, referralCount, earnings, milestone, ... }`. The orchestrator does not render anything itself. Its only job is to fan out. **The renderer pool** is a horizontally-scaled fleet of containers each running `hyperframes serve --workers=4`. Each container has four warm Chromium instances and pulls jobs from Redis. For a given job, the renderer fetches the parameter bag, materializes the composition (a single HTML template with substituted values), renders to MP4, uploads to S3, writes the result back to Redis. Average render time per 12s variant: 3.2 seconds. **The encoder layer** is, surprisingly, not a separate service. ffmpeg runs in-process within each renderer worker, fed by frame capture. We tried separating it for theoretical clarity and reverted — the inter-process overhead exceeded the parallelism gain. Encode-while-capture is the right architecture for this scale. **The CDN layer** is plain S3 with CloudFront in front of it. We write MP4s with content-hash filenames, set a far-future cache header, and let the orchestrator update a `users/<id>.json` pointer that downstream systems read. Cache invalidation is "don't" — the URL changes when the content changes. ## The single template Every variant in a 10,000-job batch is the same HTML template with different parameters. This is critical for two reasons. The first reason is creative consistency. If you have 10,000 different files, you have 10,000 things that can drift. A change to the template — different easing, larger title, new sign-off — must propagate to every variant in the next batch. With one template, the change is one diff. The second reason is rendering performance. We pre-warm the Chromium instances by loading the template *once* with placeholder parameters, then for each job we update the parameter values in-place via `window.postMessage`. Chromium does not have to reparse the document, reload fonts, or recompile CSS. The warm path is around 1.4 seconds per variant. The cold path (full reload) is around 3 seconds. Here is the template's parameter contract, simplified: ```html <script> window.addEventListener("message", (e) => { if (e.data?.type !== "hf-params") return; const p = e.data.params; document.documentElement.style.setProperty("--ref-count", p.referralCount); document.documentElement.style.setProperty("--earnings", p.earnings); document.getElementById("name").textContent = p.name; document.getElementById("milestone").textContent = p.milestone; window.dispatchEvent(new Event("hf-params-ready")); }); </script> ``` The renderer worker sends the message before starting the seek loop, and waits for `hf-params-ready` to confirm the DOM is updated. Three milliseconds of overhead per variant, versus two seconds of cold reload. Worth it. ## The caching strategy Here is a surprise: about 40% of our nightly batch never actually renders, because we cache aggressively. The cache key for a variant is `sha256(template_hash + parameter_bag)`. When a job comes in, we hash the parameters with the template version and check Redis. If the hash is present, we already rendered this variant — point the user at the existing MP4 and skip the work. You might expect parameter bags to be unique enough that caching never hits. They are not. The fintech customer has many users with `referralCount: 0`, `earnings: 0`, `milestone: "first referral"`. Those users get the same video. We render it once, serve it to all of them. The cache hit rate hovers around 38-44% depending on the day. The lesson generalizes: when you parameterize a video, your parameter space is much smaller than your audience. A 10,000-user campaign might have 6,000 distinct parameter bags. The other 4,000 are duplicates. Cache them. ## The failure modes at scale Three things break when you go from rendering one video to rendering ten thousand. Each one took us a month or two to find. **Memory leaks in Chromium.** A warm Chromium instance, rendering for hours, accumulates memory. Around the 300th variant, RSS creeps past 2GB. Around the 500th, the renderer crashes. The fix is to recycle workers after every N renders — we use N=200. Each recycle costs us ~800ms but prevents a far more expensive crash. The recycle is invisible to the orchestrator because Redis just rebalances pending jobs to other workers. **Thundering herd on S3.** When 5,000 workers all try to upload to the same bucket prefix simultaneously, you can hit S3's per-prefix throughput limits and get 503s. We fixed this by hashing the user ID into the S3 key prefix, distributing writes across hundreds of prefixes. S3 scales by prefix; we just had to give it the prefixes. **Font load failures under load.** Our first deploys fetched fonts from Google Fonts on every render. At 5,000 simultaneous renders, Google Fonts started rate-limiting us and some fonts arrived as 503s. The composition rendered with fallback fonts and the customer was unhappy. We now bundle every font as a base64-embedded data URL in the template. The HTML is larger; the failure mode is gone. ## The cost math I want to talk about money because nobody else does, and the math is important for understanding when this approach makes sense. A single 12-second variant on our infrastructure costs about $0.004 in compute. Add storage (negligible at MP4 sizes) and bandwidth (CloudFront at $0.085/GB, average MP4 is ~3MB, so $0.00026/variant). Total: roughly half a cent per variant. At 5,000 variants per night, the batch costs us about $23 in cloud compute and bandwidth. For a campaign that runs every weekday for a month: ~$500 total. Compare to the labor cost of producing 5,000 personalized videos in After Effects: it does not exist as a category because it is impossible. This is the cost structure that changes the question. When personalized video at scale costs half a cent per recipient, marketers stop asking "should we make a video for this campaign" and start asking "should we make a hundred thousand videos for this campaign." The pricing is what unlocks the volume. ## What you need to run this yourself You do not need our infrastructure to render at scale. You need three things. The [developer hub](/developers) has working examples of each. **A queue.** Redis, SQS, anything with at-least-once delivery and a sensible retry model. We use Redis because we already had it; you should use whatever your team operates. **Renderer workers.** Containerized HyperFrames instances with `hyperframes serve --workers=N` — or, if you are already on Vercel, the [Vercel integration](/integrations/vercel) handles the worker fleet for you. Run them on EC2, on Fly, on Kubernetes, on whatever you operate. The renderer is stateless; horizontal scaling is the same as for any web worker. **A storage and distribution layer.** S3 + CloudFront is the obvious answer. Cloudflare R2 + Cloudflare CDN is cheaper. Whatever you pick, make sure your filenames are content-hashed so cache invalidation is trivial. The orchestration is the place where every team customizes. Our orchestrator queries Postgres; yours might query Snowflake, Segment, or a SaaS marketing platform. The contract with the renderer is the same: enqueue a parameter bag, get back an MP4 URL. ## The next bottleneck At our current scale, the bottleneck is upload, not render. A 12-second 1080p MP4 is around 3MB. Five thousand variants is 15GB of upload from our render fleet to S3 every night. At our current network bandwidth that takes about 22 minutes — longer than the actual rendering. We are exploring two paths. The first is rendering directly into S3 via multipart upload as the encoder produces bytes. The pipe goes straight from ffmpeg to S3 without ever touching the renderer's local disk. Early prototypes show a 30% latency reduction. We will ship it as `--upload-direct` once the failure modes are characterized. The second is rendering smaller files. Most of our customers' viewers watch on phones; 720p is plenty. We are adding adaptive resolution per variant based on the destination platform. A variant going to email might render at 480p with a CTA to "watch in HD on the website." This is media engineering 101 but we have been late to it. When neither of those is the bottleneck, the next one is going to be content generation — coming up with parameter bags interesting enough to justify the variants. That is a creative problem, and it is the one the agentic loop solves. But that is [a different post](/blog/the-agents-camera). ## A note on observability One thing we underestimated when we started running production batches at this scale is how important observability becomes. When a single render fails, you read the error and fix it. When 1 in 200 of 5,000 renders fails, you cannot read 25 errors and fix them individually. You need aggregated metrics. Our observability stack for the batch pipeline is, in priority order: a Prometheus exporter on every renderer that ships per-job timing and outcome; a dashboard that breaks failures down by error class; structured JSON logs for every failed render, ingested into a search-friendly store; and a daily Slack digest of the previous night's batch ("4,832 variants rendered in 38 minutes, 12 failed, here are the categories"). The dashboard is the artifact we look at every morning. It has three numbers that matter: total variants in the batch, p95 render time, and failure rate. If any of those drift, we investigate before the customer notices. Most mornings, all three are flat and the page is uninteresting. That is the goal. ## When this approach is wrong I want to be honest about a category of work where batch rendering at this scale is the wrong answer. If your videos are bespoke — every one is a different composition, with different brand, different structure, different timing — you do not have a batch problem. You have a creative production problem. HyperFrames helps with that too, but the leverage is in agentic authoring, not in batch infrastructure. The customers I described in this post all have one template per campaign. They get leverage from variant volume, not from variant diversity. If your videos need to be long-form — three-minute documentaries, five-minute explainers — the batch math changes. Three minutes at 30fps is 5,400 frames per variant. Five thousand variants would be 27 million frames. The render cost climbs from cents to dollars per variant, and the wall time of the batch climbs from an hour to most of a day. That is still tractable but it is no longer "overnight." Plan accordingly. If your videos have heavy 3D, particle systems, or complex shaders, the render time per variant climbs beyond what a pool of CPU workers can absorb. You start to want GPU acceleration. We are working on GPU renderer pools, but they are not what we run in production today. For the heavy 3D case, render in an AE-class tool and use HyperFrames for the templated overlays. Until then: yes, you can render ten thousand variants overnight. Yes, the math works. Yes, the failure modes are tractable. The infrastructure has caught up. Now go ship a campaign. --- # Animate a bar chart from JSON in 10 minutes URL: https://hyperframes.video/blog/animated-bar-chart-tutorial Published: 2026-05-13T11:00:00.000Z Tags: data-viz, bar-chart, tutorial, css Author: hf-team A bar chart that animates from zero to its values is the most overused, hardest-to-screw-up data visualization in marketing. Annual review videos use it. Earnings calls use it. Every B2B SaaS launch video uses it. And almost all of them are built in After Effects from a CSV someone pasted into a spreadsheet at 11pm. There is a faster path: keep the data in JSON, the template in HTML, and the render in CI. The version we will build below is ten minutes of work, three hundred bytes of data, and a clean MP4 at the end. ## The data and the template, separated The split that matters: a JSON file holds the data, the HTML holds the template. The template is dumb — it reads `window.__DATA__` and lays bars out. The data is dumb — it is a flat array of `{ label, value }`. ```json [ { "label": "Q1", "value": 42 }, { "label": "Q2", "value": 68 }, { "label": "Q3", "value": 91 }, { "label": "Q4", "value": 124 } ] ``` This split is what makes the rest scale. You can swap the data without touching the template. You can swap the template without touching the data. The chart is reproducible. ## The render: three sections The template has three pieces: 1. **Title row.** Headline, subhead, source attribution. Quiet typography. 2. **Bars.** One `<div>` per data point. Width is the bar height, animated from 0 to `(value / max) * 100%`. 3. **Value labels.** A number next to each bar, counting up from 0 to the value as the bar grows. <CodeTabs tabs={[ { label: "data.json", code: `[ { "label": "Q1", "value": 42 }, { "label": "Q2", "value": 68 }, { "label": "Q3", "value": 91 }, { "label": "Q4", "value": 124 } ]` }, { label: "template.html", code: `<style> .chart { display: grid; gap: 18px; padding: 48px; } .row { display: grid; grid-template-columns: 60px 1fr 80px; gap: 16px; align-items: center; } .bar { height: 28px; background: #ff3b1f; transform-origin: left; transform: scaleX(0); transition: transform 1.4s cubic-bezier(.16,1,.3,1); } .val { font-variant-numeric: tabular-nums; } </style> <div class="chart" id="chart"></div> <script> const data = window.__DATA__; const max = Math.max(...data.map(d => d.value)); const chart = document.getElementById('chart'); data.forEach(d => { const row = document.createElement('div'); row.className = 'row'; row.innerHTML = '<span>' + d.label + '</span><div class="bar" style="width: ' + (d.value/max*100) + '%"></div><span class="val">' + d.value + '</span>'; chart.appendChild(row); }); requestAnimationFrame(() => { chart.querySelectorAll('.bar').forEach(b => b.style.transform = 'scaleX(1)'); }); </script>` }, { label: "Result", html: `<style>body{margin:0;background:#f6f5f1;font-family:ui-sans-serif,system-ui;}.chart{display:grid;gap:18px;padding:48px;}.row{display:grid;grid-template-columns:60px 1fr 80px;gap:16px;align-items:center;}.bar{height:28px;background:#ff3b1f;transform-origin:left;transform:scaleX(0);transition:transform 1.4s cubic-bezier(.16,1,.3,1);}.val{font-variant-numeric:tabular-nums;text-align:right;}</style> <div class="chart"><div class="row"><span>Q1</span><div class="bar" style="width:34%"></div><span class="val">42</span></div><div class="row"><span>Q2</span><div class="bar" style="width:55%"></div><span class="val">68</span></div><div class="row"><span>Q3</span><div class="bar" style="width:73%"></div><span class="val">91</span></div><div class="row"><span>Q4</span><div class="bar" style="width:100%"></div><span class="val">124</span></div></div> <script>requestAnimationFrame(()=>{document.querySelectorAll('.bar').forEach(b=>{const w=b.style.width;b.style.transform='scaleX(1)';});});</script>` } ]} caption="Data in JSON, template in HTML, animation in CSS." /> ## The animation: stagger the bars Bars that all animate together read as a single block. Bars that stagger by 80-120ms per row read as a story — the eye lands on Q1, then Q2, then Q3. The stagger is the entire difference between "data" and "narrative." Implement it with a `transition-delay` per row, or — cleaner — set the bar's `transform` after a `setTimeout` indexed by row. Either works. The easing should be `cubic-bezier(.16, 1, .3, 1)` — the [settle curve](/blog/easing-that-looks-like-money) that lands without bouncing. ## The value count-up The number next to each bar should count up from 0 to the final value as the bar grows. There is a temptation to interpolate linearly. Resist; tie the count to the same easing as the bar so the number is "at" the right value while the bar is growing. Practically: on every animation frame, compute `current_value = final_value * eased_t` and update `textContent`. Use `Math.round` for integers, `.toFixed(1)` for decimals. Use `font-variant-numeric: tabular-nums` on the value so digit widths do not dance. ## Going bigger: comparison and time Two extensions worth knowing: - **Comparison.** Two bars per row, color A and color B. The reveal becomes: bar A grows, bar B grows next to it, label appears underneath. - **Time.** Bars that scrub left as new ones appear right. This is harder to do well — see the [stock chart tutorial](/blog/stock-chart-animation) for the windowing trick. Both are the same engine, more numbers. ## Render the MP4 Once the chart looks right, render it. The [playground](/playground) supports importing JSON via `window.__DATA__`; the [Next.js API integration](/integrations/nextjs) takes a POST and returns an MP4. Pick whichever surface you live in. The whole pipeline is reproducible: same JSON, same template, same MP4. If marketing wants a Spanish version, you swap the labels and re-render. If the numbers update, you POST the new JSON. The chart is a build artifact. That is the deal: ten minutes of code, infinite re-renders, no After Effects. --- # AI video generation: wire ChatGPT or Claude to an MP4 endpoint URL: https://hyperframes.video/blog/ai-video-generation-api Published: 2026-05-12T16:00:00.000Z Tags: ai, video, api, openai, claude Author: ren-park The pitch most "AI video generation" startups make is wrong. The interesting AI video pipeline is not "diffusion model generates frames pixel-by-pixel" — those outputs are unreliable, expensive, and undeterministic. The interesting pipeline is "LLM writes HTML, renderer turns HTML into video." The LLM generates *code*, which is what LLMs are actually good at. Here is that pipeline end-to-end: prompt → HTML → MP4. ## Why HTML is the right intermediate Three reasons HTML beats pixels as the AI's output target: 1. **LLMs are excellent at HTML.** Years of training data; small models can output coherent CSS animations. 2. **HTML is editable.** A human can read the output and fix the one wrong color before rendering. 3. **HTML renders deterministically.** Same HTML, same MP4. Same pixels, every time. Diffusion-generated frames have none of these. The version you render today is not the version you render tomorrow. The output is not text and you cannot edit it. The LLM/HTML pipeline is the only one that fits an engineering workflow. ## The system prompt The single most-important file in the pipeline. The LLM needs to know: - The output shape (a single HTML document) - The animation contract (`addEventListener('hf-seek', ...)`, no `setInterval`) - The variable system (`{{$VAR}}` placeholders) - The aspect ratio and duration A working system prompt, abbreviated: ``` You generate deterministic animation templates as single HTML documents. Constraints: - Output a single HTML document. No external assets. - All animation runs as a pure function of t (the playhead time in seconds). - Listen for `hf-seek` CustomEvents on window; read e.detail.time. - Never use setInterval, setTimeout, or requestAnimationFrame for animation. - Use {{$VAR}} placeholders for any text the user might change. - The animation should loop within data-duration seconds. Example structure: <!doctype html><html data-duration="6" data-aspect="16:9"> <head><style>...</style></head> <body>... <script> function render(t) { /* mutate DOM based on t */ } addEventListener('hf-seek', e => render(e.detail.time)); render(0); </script> </body></html> ``` Both the [OpenAI](/integrations/openai) and Claude integrations ship a longer version of this prompt. ## The route handler The full pipeline as a single Next.js Route Handler: ```ts // app/api/ai-render/route.ts const openai = new OpenAI(); const { prompt } = await req.json(); const completion = await openai.chat.completions.create({ model: 'gpt-4o', messages: [ { role: 'system', content: SYSTEM_PROMPT }, { role: 'user', content: prompt }, ], }); const html = extractHtml(completion.choices[0].message.content); if (!validateHtml(html)) return new Response('Generation failed', { status: 422 }); const mp4 = await renderHtmlToMp4(html, { width: 1920, height: 1080, duration: 6 }); return new Response(mp4, { headers: { 'content-type': 'video/mp4' } }); } ``` `extractHtml` pulls the HTML out of the LLM's response (it usually wraps in ```html ... ```). `validateHtml` sanity-checks: has `<!doctype>`, has `hf-seek` listener, has reasonable element count. We skip the renderer if validation fails and surface the error to the user. ## The result, end to end What it actually looks like, prompt to pixels: <CodeTabs tabs={[ { label: "Prompt", code: `"A 6-second title card for a podcast called 'Frame by Frame', episode 42, with the host name Kira Tanaka. Dark background, orange accent for the episode number."` }, { label: "Generated HTML", code: `<!doctype html><html data-duration="6" data-aspect="16:9"> <head><style> body { margin: 0; background: #0a0a0a; color: white; height: 100vh; display: grid; place-items: center; font-family: ui-sans-serif, system-ui; } .stage { text-align: center; } .show { font-size: 24px; letter-spacing: .3em; opacity: 0; } .ep { font-size: 200px; font-weight: 900; color: #ff3b1f; opacity: 0; transform: translateY(20px); } .title { font-size: 48px; margin-top: 16px; opacity: 0; } .host { font-size: 18px; opacity: 0; margin-top: 24px; } </style></head> <body><div class="stage"> <div class="show" id="s">FRAME BY FRAME</div> <div class="ep" id="e">042</div> <div class="title" id="t">The Art of Determinism</div> <div class="host" id="h">with Kira Tanaka</div> </div> <script> function ease(t){ return 1-Math.pow(1-t,3); } function render(t) { document.getElementById('s').style.opacity = ease(Math.min(1, t/0.6)); var u = ease(Math.max(0, Math.min(1, (t-0.6)/0.8))); var e = document.getElementById('e'); e.style.opacity = u; e.style.transform = 'translateY(' + (1-u)*20 + 'px)'; document.getElementById('t').style.opacity = ease(Math.max(0, Math.min(1, (t-1.4)/0.6))); document.getElementById('h').style.opacity = ease(Math.max(0, Math.min(1, (t-1.9)/0.6))); } addEventListener('hf-seek', e => render(e.detail.time)); render(0); </script></body></html>` }, { label: "Rendered MP4", html: `<style>body{margin:0;background:#0a0a0a;color:white;height:100vh;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .s{text-align:center;} .show{font-size:18px;letter-spacing:.3em;opacity:.7;} .ep{font-size:140px;font-weight:900;color:#ff3b1f;line-height:1;margin:8px 0;} .t{font-size:32px;} .h{font-size:14px;opacity:.6;margin-top:16px;}</style> <div class="s"><div class="show">FRAME BY FRAME</div><div class="ep">042</div><div class="t">The Art of Determinism</div><div class="h">with Kira Tanaka</div></div>` } ]} caption="A real prompt, the LLM's HTML output, and one frame of the resulting MP4." /> ## The safety net LLMs hallucinate. Two safeguards I run in production: 1. **Validation.** Reject any HTML that fails to compile, lacks `hf-seek`, or exceeds 50KB. Re-prompt with the validation error. 2. **Sandboxing.** Render in an isolated worker with no network access. Even if the LLM emits malicious JS, it has nowhere to send the data. Most "AI fails dramatically" stories are missing these two. With them, the worst case is a render that produces a boring video, not a security incident. ## What this lets you build The interesting downstream surfaces, in order of how much they will surprise you: - **Slack bot that renders MP4 explainers** from any user prompt. - **Internal tools** that turn a Linear ticket into a 6-second customer-facing video. - **End-user features** where the prompt is "explain my dashboard" and the output is a personalized video walkthrough. The [Agents Camera post](/blog/the-agents-camera) covers the broader frame here — agents that emit video are the next interesting product surface, and the LLM-to-HTML pipeline is what makes them economical. ## What still needs human review Three things the LLM is bad at, in 2026: 1. **Brand consistency.** It will pick "an orange" but not "your orange." Pin colors in the system prompt. 2. **Long durations.** 6-second renders are reliable; 30-second renders drift in pacing. 3. **Type hierarchy.** The LLM will use one big font and one small font; getting the middle tier right takes a human pass. Wire the LLM to draft. Have a human edit. Render. Ship. The combined loop is a 10x improvement over either pure-human or pure-LLM workflows. See also: [the OpenAI integration](/integrations/openai) and the [render API surface](/developers) for the SDK details. --- # Easing that looks like money URL: https://hyperframes.video/blog/easing-that-looks-like-money Published: 2026-05-12T15:00:00.000Z Tags: design, easing, animation, craft Author: marcus-okafor I have a small, embarrassing test I run on motion design portfolios when I am hiring. I scrub through the reel and ignore the typography. I ignore the color. I ignore the cleverness of the concept. I watch only the easing. Within thirty seconds I have an opinion, and the opinion is rarely wrong. The reason is that easing is the part of motion that takes the longest to develop taste for, and the part that gets defaulted-to the most by people who haven't. Every other element in a composition can be cribbed: a font from a magazine, a palette from Coolors, a layout from a Dribbble shot. Easing is what you bring. It is your handwriting. The good news is that taste is teachable, and easing is one of the small domains where a few specific curves, internalized and reused, can carry the entire personality of your work. I want to spend this post going through six curves that I use professionally, what each one communicates, and why the difference between an okay curve and a great one is what separates an ad from a Super Bowl ad. ## The grammar of an easing curve A cubic Bezier easing curve in CSS has four numbers: `cubic-bezier(x1, y1, x2, y2)`. The curve starts at `(0,0)` and ends at `(1,1)`. The two control points pull the curve toward themselves between the endpoints. Y is "progress through the animation." X is "time elapsed." The slope of the curve at any point is "how fast the animation is moving at that moment." This is technical and not actually how I think about it day-to-day. Day-to-day, every easing curve is one of three shapes: *ease in* (slow start, fast finish — feels like falling), *ease out* (fast start, slow finish — feels like landing), or *ease in-out* (slow on both ends — feels like a sigh). Within each shape, the exact numbers determine the personality: tight or loose, springy or weighty, mechanical or organic. The browser's defaults — `ease`, `ease-in`, `ease-out`, `ease-in-out` — are *fine*. They are the equivalent of Times New Roman: they communicate "I did not have an opinion." A motion designer with opinions does not use them. ## Curve 1: The settle ```css cubic-bezier(.16, 1, .3, 1) ``` This is the cinematic ease-out I default to for any element that *arrives*. Titles landing, modals appearing, cards rising into view. It spends 70% of its duration in the last 30% of its travel — meaning the element rushes in, then *settles* slowly into place over the long tail. What it communicates: confidence. The element knows where it is going. It does not overshoot. It does not bounce. It arrives like an actor hitting their mark. I use this curve more than all the others combined. If you can only learn one, learn this one. Pair it with a duration between 600ms and 900ms for most editorial motion. Shorter feels rushed; longer feels indulgent. ## Curve 2: The launch ```css cubic-bezier(.7, 0, .84, 0) ``` The opposite of the settle. Slow start, fast finish. I use it for elements that *depart* — exit animations, content scrolling out of frame, transitions where the outgoing element should accelerate as it leaves. What it communicates: dismissal. The element is gone. It does not linger. The trap with this curve is using it for entrance animations. The slow start feels hesitant on entrance — the element looks like it is unsure whether to arrive. Save it for exits. Pair it with a slightly shorter duration (300-500ms) because exits should not draw attention to themselves. ## Curve 3: The professional in-out ```css cubic-bezier(.65, 0, .35, 1) ``` When something both starts and stops, I reach for this. It is a slightly asymmetric in-out — biased toward a longer tail than a longer ramp-up — which prevents the "rocking horse" feeling you get from symmetric eases. What it communicates: control. The element is moving from A to B with purpose. Neither flailing on departure nor crashing on arrival. This is my default for chart animations (a bar growing from 0 to its value), for camera moves (the viewport panning across a scene), for any animation that has a clear start and end and should feel mechanical-but-not-robotic. Duration: 800ms to 1.4s depending on travel distance. ## Curve 4: The bounce-in ```css cubic-bezier(.34, 1.56, .64, 1) ``` Notice the `1.56` — values above 1 push the curve past the endpoint and back. This creates a literal overshoot. The element arrives, goes slightly past, and snaps back. What it communicates: personality. Whimsy, energy, sometimes immaturity. I use this sparingly. A bounce on a corporate title sequence feels childish. A bounce on a consumer app feature reveal feels delightful. Read the brand before reaching for it. When you do use a bounce, pair it with a short duration (500-700ms) and a small overshoot amount. A large overshoot feels desperate; a small one feels intentional. The number `1.56` is my standard; `1.7+` veers into novelty. ## Curve 5: The anticipate-and-strike ```css cubic-bezier(.7, -.5, .4, 1.4) ``` Both control points are outside the (0,1) range, creating an S-curve that dips below and overshoots above. The element first moves *backward* slightly, then springs forward past its target, then settles. What it communicates: muscularity, drama, *attention*. This is the curve I use for the single most important reveal in a composition — the headline of a hero shot, the answer in an explainer, the punchline of an ad. It is the visual equivalent of a director's pause before the reveal. It is also a curve to use exactly once per composition. Twice is too many. The whole point is that it draws the eye; if the eye is drawn three times, the eye gets bored. Duration matters here: 700-1000ms is the window. Shorter and the anticipation does not read; longer and it feels theatrical. ## Curve 6: The whisper ```css cubic-bezier(.4, 0, .2, 1) ``` This is the gentlest ease in my regular rotation. Material Design has used variations of it as their default for years, and there is a reason: it works for almost everything, communicates almost nothing, and gets out of the way. What it communicates: nothing in particular. It is the no-opinion ease, but the *good* version of no-opinion. When I want motion to be present but not commented-upon — background gradient drift, subtle scale on hover, image filters fading in — I reach for the whisper. The trap is using only this curve. A reel of compositions using only the whisper is technically competent and emotionally flat. The whisper is the supporting cast, not the lead. ## What an easing curve is doing for the brain A digression that I find useful when teaching this. The reason easing works at all is that the human visual system is exquisitely tuned for predicting motion. We evolved to track prey and predators; our brains run a real-time forward model of "where will this thing be in 100ms." When motion confirms the model, the motion feels *natural*. When motion violates the model, the brain flags it as worth attention. Linear motion (no easing) violates the model in a small, persistent way: real-world objects do not move at constant velocity. They accelerate and decelerate. Linear motion looks mechanical because it is mechanical. Cinematic easing — the settle, the professional in-out — confirms the model. The eye expects deceleration as an object approaches its rest position, and gets it. The motion feels natural even though it is, mathematically, just as artificial as linear. The dramatic curves — anticipate-and-strike, bounce — *deliberately* violate the model. The brain flags them. Attention is captured. This is why a bouncy entrance pulls the eye and a linear entrance fades into the background. The motion designer is exploiting a perceptual bug for editorial effect. ## The number that fixes everything: duration A confession. I have spent more time tuning durations than tuning curves. The curve sets the shape of the motion; the duration sets its *weight*. A settle at 400ms feels punchy and energetic. The same settle at 1200ms feels lavish and indulgent. The same settle at 2000ms feels like the composition has stopped working. My ranges, by category: - Entrances of small elements (captions, lower thirds): 400-700ms - Entrances of headlines: 700-1100ms - Camera moves and pans: 1000-2000ms - Backdrop drifts: 4000-8000ms (very slow on purpose) - Exits of anything: 200-400ms (always faster than the entrance) The rule I keep coming back to: exits should be 50-70% the duration of entrances. The viewer has already seen the element; they do not need time to read it on the way out. Fast exits feel snappy; slow exits feel like the composition is dragging. ## Putting it together Here is the pattern I teach motion designers I am bringing onto HyperFrames. Pick one curve per composition for the bulk of the work — usually the settle, occasionally the professional in-out. Pick one curve for the hero moment — usually the anticipate-and-strike. Use the whisper for backgrounds. Use the launch for exits. Maybe deploy a bounce somewhere for a single charming detail. That is five curves total in a 30-second composition. Five curves, six durations, and the entire personality of the piece emerges. (If you are coming from a timeline tool, the [After Effects comparison](/compare/after-effects) covers how this maps to graph-editor habits.) The "looks like money" feeling is the cumulative effect of every one of those choices being deliberate. You do not need a library. You do not need a preset pack. You need to type these six numbers into a CSS file, and to *feel* the difference each one makes. Open a composition in the [HyperFrames playground](/playground). Replace `ease-out` with `cubic-bezier(.16, 1, .3, 1)`. Render. Watch. Feel. ## How to develop the ear A practical exercise I give people who want to develop taste in easing: pick three motion designers whose work you admire. For each, watch ten seconds of one of their pieces, frame-by-frame, scrubbing. Pay attention to nothing but the easing. After two or three watches, try to write down — in cubic-bezier numbers — what easing they used. You will be wrong, the first ten times. By the twentieth, you will be in the right neighborhood. By the fiftieth, you will be naming the curve before you finish the first viewing. This is the same way ear training works for musicians: repeated exposure, deliberate attention, gradual calibration. The curve I most often see junior designers misidentify is the difference between a settle (`cubic-bezier(.16, 1, .3, 1)`) and a Material-style standard ease (`cubic-bezier(.4, 0, .2, 1)`). They look similar in a thumbnail. They feel completely different at 60fps in motion. The settle hangs longer at the end; the Material curve resolves more cleanly. One is editorial; the other is product. Knowing which one is which, by feel, is most of the work. ## The curve as signature A final point I want to leave you with. Every motion designer I respect has a signature curve. It is not their *only* curve, but it is the one they reach for first, the one they default to when nothing else suggests itself. You can recognize their work, sometimes, by the easing alone. Mine is the settle, `cubic-bezier(.16, 1, .3, 1)`. I have used it on every project I have shipped this year. It is the curve that, to my eye, communicates the kind of confidence I want my work to have. Yours might be different — a tighter settle, a more dramatic anticipate-and-strike, a perfectly tuned Material curve. The point is to have one. To know it. To use it deliberately. The work that looks like money is the work that knows what it is doing with every number. Start with the curves. The rest follows. Now you know. --- # How to generate TikTok videos from a template (the engineering way) URL: https://hyperframes.video/blog/tiktok-video-from-template Published: 2026-05-12T09:00:00.000Z Tags: tiktok, social, automation, 9:16, tutorial Author: ren-park If you are running a creator program, an affiliate funnel, or any campaign that needs more TikTok output than three editors can ship, the bottleneck is never ideas. It is the eight clicks per variant in CapCut. Move the template into HTML, render the variants from a CSV, and the throughput problem becomes a throughput non-problem. This is the engineering build of "make a TikTok." It assumes the format (9:16, ~30s), the safe areas (TikTok's bottom UI eats 12% of the canvas), and the deterministic output that lets you batch a hundred renders without watching any of them. ## The TikTok canvas, technically A vertical TikTok is `1080 × 1920` at 30 or 60fps. The platform overlays its own UI — caption, username, action rail — over your video. The areas that get covered: - **Bottom 240px**: caption + username. Anything load-bearing here will be hidden. - **Right 180px, lower half**: like / comment / share / sound rail. - **Top 120px**: "For You / Following" tabs (mostly transparent, but viewers' eyes flick there). Design the safe zone as the middle 720px wide × 1440px tall block, centered horizontally, biased slightly upward. <InlineSandbox html={`<!doctype html> <html><body style="margin:0;background:#0a0a0a;display:grid;place-items:center;min-height:100vh;font-family:ui-sans-serif,system-ui;"> <div style="position:relative;width:270px;height:480px;background:linear-gradient(160deg,#ff3b1f,#ff6a4a 60%,#ffb800);border-radius:18px;overflow:hidden;color:#fff;"> <div style="position:absolute;top:0;left:0;right:0;height:30px;background:rgba(0,0,0,.25);display:flex;align-items:center;justify-content:center;font-size:10px;letter-spacing:.2em;text-transform:uppercase;opacity:.6;">FOR YOU · FOLLOWING</div> <div style="position:absolute;inset:30px 18px 60px 18px;outline:2px dashed rgba(255,255,255,.55);border-radius:14px;display:grid;place-items:center;text-align:center;padding:24px;"> <div><div style="font-weight:800;font-size:32px;letter-spacing:-.02em;line-height:1.05;">SAFE<br/>ZONE</div><div style="margin-top:8px;font-size:11px;opacity:.8;letter-spacing:.2em;text-transform:uppercase;">720 × 1440</div></div> </div> <div style="position:absolute;right:6px;bottom:80px;display:grid;gap:14px;"> <div style="width:28px;height:28px;border-radius:50%;background:rgba(255,255,255,.18);"></div> <div style="width:28px;height:28px;border-radius:50%;background:rgba(255,255,255,.18);"></div> <div style="width:28px;height:28px;border-radius:50%;background:rgba(255,255,255,.18);"></div> </div> <div style="position:absolute;left:14px;right:50px;bottom:14px;"> <div style="font-weight:700;font-size:12px;">@yourbrand</div> <div style="font-size:11px;opacity:.85;margin-top:2px;line-height:1.3;">Caption goes here. The line wraps. Maybe a hashtag. #fyp</div> </div> </div> </body></html>`} height={520} caption="The 9:16 canvas with TikTok's UI overlays mapped. The dashed area is what's actually seen." /> ## The template structure A reusable TikTok template has four moving parts: 1. **A hook frame** (0.0–1.2s) — big text, instantly readable, sets up the payoff. 2. **A reveal block** (1.2–6s) — the content. Could be a chart, a product shot, a quote. 3. **A CTA frame** (6–8s) — "link in bio," account handle, one image. 4. **A subtle loop hint** — the last frame matches the first, so the algorithmically-driven replay feels seamless. That last point is underrated. TikTok auto-loops videos, and a video that loops invisibly gets a 30–60% watch-time lift in our (informal) testing. ## Variables a TikTok template should expose Keep the variable list short. The marketer touches four things: - **Hook text** ("3 reasons we shut down our Discord") - **Body text or image asset** - **Brand color** - **CTA handle** Lock everything else — easing, type scale, safe zone padding — in the template. Variability should live in the data, not the design. <VariableKnobs html={`<style>body{margin:0;background:#000;height:480px;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .canvas{position:relative;width:270px;height:480px;background:{{$BG}};color:#fff;overflow:hidden;border-radius:18px;} .hook{position:absolute;inset:60px 24px auto 24px;font-weight:800;font-size:30px;letter-spacing:-.02em;line-height:1.05;text-shadow:0 2px 8px rgba(0,0,0,.4);} .cta{position:absolute;left:14px;right:50px;bottom:80px;background:rgba(0,0,0,.65);backdrop-filter:blur(8px);padding:10px 12px;border-radius:10px;} .cta b{font-weight:700;font-size:13px;} .cta s{display:block;font-weight:400;font-size:11px;opacity:.85;text-decoration:none;margin-top:2px;} .dot{position:absolute;top:24px;left:24px;width:8px;height:8px;border-radius:50%;background:{{$ACCENT}};animation:p 1s ease-in-out infinite;} @keyframes p{50%{transform:scale(1.6);opacity:.6;}} </style> <div class="canvas"> <div class="dot"></div> <div class="hook">{{$HOOK}}</div> <div class="cta"><b>@{{$HANDLE}}</b><s>{{$CTA}}</s></div> </div>`} knobs={[ { name: "HOOK", label: "Hook text", default: "3 reasons we deleted our Discord" }, { name: "HANDLE", label: "Handle", default: "hyperframes" }, { name: "CTA", label: "CTA line", default: "Full breakdown — link in bio" }, { name: "BG", label: "Background", type: "color", default: "#1a0030" }, { name: "ACCENT", label: "Accent dot", type: "color", default: "#ff3b1f" } ]} height={520} /> ## Batch rendering from a CSV The whole point of moving to code is this step. Drop a CSV with columns matching your template variables — `hook,handle,cta,bg,accent` — and render one MP4 per row. See [batch-personalized videos from CSV](/blog/batch-personalized-videos-from-csv) for the exact pipeline. A reasonable cadence: ten templates × twenty CSV rows = 200 unique TikToks per upload session. The marketer picks the top 30 from a thumbnail sheet and queues them through your scheduler. ## Why this beats CapCut, eventually CapCut wins on the first ten videos. It loses on the hundredth because every video is a separate file with its own state. A code template treats those hundred videos as data — change the brand color across all of them in one commit, re-render overnight, ship in the morning. The dividing line is usually around your fourth weekly batch. By the time you are doing this every Friday, the [render pipeline](/tools/html-to-video) saves a workday per week. ## Beyond TikTok The same template, with the safe-zone block moved, becomes an [Instagram Reels](/blog/instagram-reels-automation) video or a [YouTube Short](/blog/youtube-shorts-generator). Render three aspect ratios off the same source and the platform decision becomes a config flag, not a re-edit. [Open the playground](/playground), build the first one, then schedule the batch. --- # Confetti, sparkles, and other particle tricks in pure CSS URL: https://hyperframes.video/blog/css-confetti-particle-effect Published: 2026-05-11T14:00:00.000Z Tags: css, particles, vfx, tutorial Author: hf-team The most over-engineered animation in the modern web is confetti. There is a library for it. There are several libraries for it. None of them are wrong, but you do not actually need any of them — a 200-particle confetti burst is forty lines of plain code, and the version you write yourself looks better because you tuned it. We will walk through a confetti burst, then generalize to sparkles, snow, and "money rain." The same engine drives all four. ## The data model Every particle is the same five numbers: position `(x, y)`, velocity `(vx, vy)`, rotation `rot`, rotational velocity `vr`, and a color. Build an array of 200 of these, seeded randomly. That is the entire state. Then on each frame, you do not actually re-simulate anything — you compute the particle's position *as a pure function of time*. Position at time `t` is `start + velocity * t + 0.5 * gravity * t * t`. Rotation is `start_rot + vr * t`. This is what makes the animation deterministic: every frame is independent of every other frame. ## Why a pure function of time? Most particle systems update state every frame: `p.x += p.vx * dt`. This works, until you want to render to MP4 — then small dt variations accumulate into visible drift between renders. A pure function of time produces the same image at the same `t`, every time. Render-determinism falls out for free. This is the same principle behind [our deterministic rendering manifesto](/blog/deterministic-video-manifesto): never integrate state, always compute from a clock. If you want a different burst, change the seed. If you want the same burst tomorrow, you get it. ## The three knobs that matter Once the engine works, the three numbers that change the *feel* are: 1. **Particle count.** 50 reads as "splash," 200 as "confetti," 600 as "explosion." Pick the one that matches your moment. 2. **Gravity.** Real-world is ~980 px/s². But video time is compressed — try 600-900 for a confetti shower, 1500+ for a "drop." 3. **Initial velocity spread.** Wider spread means more chaos. For a centered burst, a uniform spherical spread feels right. For a directional flick, bias the velocity. <VariableKnobs html={`<style>body{margin:0;background:#f6f5f1;height:100vh;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .msg{font-size:64px;font-weight:800;letter-spacing:-.04em;} canvas{position:absolute;inset:0;pointer-events:none;}</style> <div class="msg">🎉 {{$MSG}}</div> <canvas id="c"></canvas> <script> var c=document.getElementById('c');var ctx=c.getContext('2d'); function fit(){c.width=innerWidth;c.height=innerHeight;}fit();addEventListener('resize',fit); var COLORS=['#ff3b1f','#ffb800','#1f8a5b','#2b66ff','#ff6a4a']; var N={{$COUNT}};var G={{$GRAVITY}}; var parts=[];var s=1;function r(){s=(s*9301+49297)%233280;return s/233280;} for(var i=0;i<N;i++)parts.push({x:c.width/2,y:c.height/2,vx:(r()-.5)*900,vy:(r()-.9)*700,rot:r()*Math.PI*2,vr:(r()-.5)*12,size:8+r()*8,color:COLORS[Math.floor(r()*COLORS.length)]}); function render(t){ctx.clearRect(0,0,c.width,c.height);parts.forEach(function(p){var x=p.x+p.vx*t;var y=p.y+p.vy*t+0.5*G*t*t;var rot=p.rot+p.vr*t;ctx.save();ctx.translate(x,y);ctx.rotate(rot);ctx.fillStyle=p.color;ctx.fillRect(-p.size/2,-p.size/4,p.size,p.size/2);ctx.restore();});} var tt=0;function loop(){tt=(tt+0.016)%5;render(tt);requestAnimationFrame(loop);}loop(); </script>`} knobs={[ { name: "MSG", label: "Headline", default: "Shipped!" }, { name: "COUNT", label: "Particle count", type: "number", default: "200", min: 20, max: 600, step: 20 }, { name: "GRAVITY", label: "Gravity", type: "number", default: "800", min: 100, max: 2000, step: 50 } ]} /> ## Generalizing: sparkles and snow Same engine, different parameters: - **Sparkles:** gravity 0, lifetime 1.5s, opacity fades in then out. Spawn continuously rather than as a burst. - **Snow:** gravity 100, wind term (small `sin(t * freq)` offset on `x`), opacity 0.7, size variance is high. - **Money rain:** rectangles with `4:8` aspect, gravity 1200, no rotation drift (bills fall flat), low color variance. Each is a tweak to two or three numbers in the same loop. If you find yourself writing a new particle engine for each effect, you are over-engineering. ## What canvas gives you that DOM does not 200 absolutely-positioned `<div>` elements *will* render confetti. They will also drop frames at high counts, especially on mobile. A single canvas with 200 fills runs at 60fps on a five-year-old phone. The rule of thumb: if you have more than ~50 moving elements, switch to canvas. The API is two function calls (`fillRect`, `translate/rotate`) and you get back orders-of-magnitude headroom. ## Pairing confetti with a payoff A confetti burst with no anchor reads as noise. Always pair it with a payoff in the center — the headline, the number, the brand mark. The confetti is the punctuation; the payoff is the sentence. See also our notes on [motion graphics in 80 lines](/blog/motion-graphics-in-80-lines) for more on this principle. ## Rendering to MP4 Because the simulation is a pure function of `t`, rendering to MP4 is the same as rendering to the screen — just capture each frame at fixed `t` intervals and encode. The [HyperFrames render pipeline](/tools/html-to-video) does exactly this; you get a 30fps MP4 with no frame drops and no dt drift. The whole point: confetti is forty lines, you understand the math, you can tune the feel, and the output is identical every time you press render. That is the deal. --- # Git for video URL: https://hyperframes.video/blog/git-for-video Published: 2026-05-11T13:30:00.000Z Tags: workflow, version-control, git, collaboration Author: ren-park There is a category of tool that promises "version control for designers" every few years. Abstract had a moment. Plant had a moment. Figma's version history. Frame.io's review system. Each of these solves a real problem and stops short of the thing software engineers have had since 2005. They give you *snapshots*, not *history*. You can roll back, but you cannot diff. You can comment, but you cannot blame. HyperFrames compositions are HTML. HTML is text. Text fits in git. This sounds like a small technical detail until you have spent six months using it. Then it sounds like the most important feature the tool has. I want to spend this post walking through what git-for-video actually looks like in practice — what branches mean, what code review of a motion design looks like, what the rollback button does — and why the alternative tools cannot do these things even when they want to. ## The shape of the problem Motion design has a collaboration problem nobody likes to talk about. Three designers on one project. Each one has a copy of the AE file. They make changes in parallel. At some point, somebody has to *merge* their changes. There is no merge. There is one person opening four versions of the file and copy-pasting layers, hoping nothing breaks. I have watched senior motion designers spend half a day reconciling versions of a 45-second spot. The agency I worked at before HyperFrames had a rule: only one person could touch the project file at a time. The rule was enforced by Slack message. Sometimes it broke. This is not a tool problem. It is an *artifact* problem. Binary files do not merge. Three-way merge requires the merge algorithm to understand structure, and AE projects are an opaque blob from git's perspective. The best git can do is "use mine" or "use theirs," which is not version control. It is a lottery. When the composition is HTML, git's three-way merge works. Two designers can edit different sections of the same file and git stitches the changes together. Conflicts happen, and conflicts are resolvable, because git can see what each side changed. The whole class of "I lost two hours of work because Slack said it was my turn but I didn't see the message" goes away. ## What a meaningful diff looks like Let us go through a real one. A designer on my team made a change to a hero composition last week. Here is the relevant slice of the diff: ```diff .title { font-size: 96px; - font-weight: 500; + font-weight: 600; letter-spacing: -0.02em; - animation: rise 700ms cubic-bezier(.2,.7,.1,1) 200ms backwards; + animation: rise 900ms cubic-bezier(.16,1,.3,1) 200ms backwards; } ``` Four lines of change. Two changes. The font-weight got heavier. The easing got more cinematic and a bit longer. When this diff hit my pull request inbox, I could read the *intent*: "make the title land harder." I left a comment on line 4. "Let's try 240ms delay instead of 200ms — gives the backdrop a beat more time to settle." She replied. We went back and forth twice. We merged. The whole exchange happened in GitHub, in async, with both of us looking at the same numbers. Try doing this in After Effects. There is no review interface. There is no inline comment. There is no way to say "let's try 240ms instead of 200ms" without opening the file, finding the keyframe, recording a screen capture, and posting it in Slack. The medium of conversation has to change every time, and every change loses fidelity. ## Branches as variants Here is where it gets interesting. In git, branches are cheap. You can have a hundred branches. Most exist for a day. They are how you experiment. For video, this maps to *variants*. We have a campaign coming up — a fintech launch — and the brief calls for fifteen variants of the same 15-second spot. Same structure, different copy, different colors per market, different durations for some platforms. The traditional approach is fifteen AE files in fifteen folders. The HyperFrames approach is one main composition and fifteen branches off it. Each branch differs from main by a small diff: this line of copy, this color token, this duration. When we discover an improvement in one branch, we cherry-pick the commit back to main. When main improves (a new easing curve, a better caption position), every variant rebases and inherits the improvement. The discipline is the same as software branching: keep diffs small, merge often, name branches descriptively. The result is a campaign that ships as a coherent set, not as fifteen drifted copies that each have a different version of "the same" thing. ## What code review of a motion design feels like I want to describe this concretely because it surprised me how natural it became. A PR lands in our queue. The title is "Tighten the title rise easing on hero composition." I open the PR. GitHub shows me the diff: three lines. I open the preview link in the PR description — every PR has a deploy preview that renders the composition, wired up through the [GitHub Actions integration](/integrations/github-actions) and posted from a [Vercel preview deploy](/integrations/vercel). I scrub through. I watch the before-and-after on the same screen. I leave a single comment on the line that changed: "Yes, much better. Approving." The whole interaction takes ninety seconds. The reviewer (me) and the author (a colleague) never get on a call. There is a permanent record. The merged commit is forever attributable. Compare to a typical motion design review. The designer exports the file as an MP4. They DM it to the reviewer. The reviewer watches it. They reply with a paragraph of feedback. The designer interprets the paragraph, makes changes, exports again, DMs again. Three rounds later, the file ships. Nobody remembers exactly what was reviewed. The MP4 link expires. There is no audit trail. The first time I shipped a motion design change as a pull request and saw it go through code review, I realized this was the missing piece. Motion design has been a craft without process. Process is what makes work scale beyond one designer. ## Rollback as a verb There is a feature in HyperFrames I use approximately weekly. I do not advertise it because it is so boring that nobody asks about it. I will advertise it now. `git checkout HEAD~3 -- composition.html && hyperframes render` That command reverts the composition to three commits ago, renders it, and writes a new MP4. Total time, about ten seconds. The client said "can we see what it looked like before yesterday's change?" I have an answer in ten seconds. Not "let me find the old project file." Not "let me re-export from the backup." A real answer, derived from real history, with the actual change reverted. The MP4 output has a sidecar manifest that records the commit hash. Two weeks from now, when someone asks "where did this MP4 come from?", the manifest answers. We know the exact source. We can regenerate. This is the boring superpower of putting compositions in git: *every render is reproducible from its source*. The MP4 is never the source of truth. The commit is. When the MP4 is lost, deleted, corrupted, or out of date, we generate a new one. When the source is lost, we have not really lost it either, because it is on every machine that has the repo. ## What the other tools cannot do I want to address the alternatives directly. Figma's version history is excellent for static design. It is a snapshot system. It does not produce diffs. You cannot branch a Figma file in a way that allows merge. You cannot review a Figma change as a pull request. The history is a list of named snapshots, not a graph of commits. Frame.io's review system is excellent for client review. You upload an MP4 and stakeholders comment with frame-level timestamps. It is a great commenting tool. It is not version control. The artifact under review is an export, not the source. When the client says "go back to the version from Tuesday," somebody has to find Tuesday's project file. Abstract (now defunct) tried to be git-for-Sketch and almost made it. The hard problem they did not solve was three-way merge of binary Sketch files. The XML-extracted diffs they showed in review were a hint of what was possible but not a complete answer. The tool died, in part, because the artifact format never gave them the leverage they needed. (For a longer take on the timeline-tool gap, see the [After Effects comparison](/compare/after-effects).) The reason these tools cannot do what git does is that their artifact is not text. The artifact decides what the version control system can do. HyperFrames bet that the right artifact is HTML. The bet pays off here. ## What this changes about hiring and onboarding A side effect I did not predict. When I joined this team, my onboarding took half a day. I cloned the repo. I read the README. I ran `npx hyperframes init`. I read one composition end-to-end. I started shipping that afternoon. Compare to the onboarding for a motion design team. New designer arrives. Someone walks them through the project file conventions. They ask where the brand kit lives. They are told "DM Sarah, she has the latest version." They poke at AE for two days before they ship anything. Three weeks in, they still find files that follow conventions nobody remembers establishing. When the work is in a repo, the work documents itself. The README is the team's wisdom, version-controlled. The compositions are the examples, all readable. The conventions are linted. A new hire reads the codebase the way they would read any codebase, and they ramp at the speed they would ramp on any codebase. This is, structurally, the same thing that happened to web development between 2008 and 2015. Static site generators, npm, git workflows — these did not change *what* web developers shipped, but they changed *who could ship it* and *how fast*. Motion design is having that moment. ## How to start If you are convinced and want to try this on a real project, here is a minimum viable workflow. Create a git repo. Add `compositions/` as a directory. Put one HTML file in it. Write a `Makefile` (or `package.json` scripts) with `render` and `preview` targets. Add a `brand.css` for tokens. Commit. From now on, every change to the composition is a commit. Every variant is a branch. Every render writes a sidecar with the commit hash. Every review is a pull request. This is not new infrastructure. It is the same infrastructure your software team has been using since 2010. The only change is that the artifact under management is now also a motion design. That is the unlock. Open a terminal. `git init`. Begin. --- # Generate a podcast thumbnail for every episode (automatically) URL: https://hyperframes.video/blog/podcast-thumbnail-generator Published: 2026-05-11T09:00:00.000Z Tags: podcast, thumbnail, template, automation Author: hf-team A podcast that ships weekly needs a fresh cover every week. Most shows ship the same cover for forty episodes because re-doing it in Figma is enough friction to skip. Per-episode covers — with the title, the number, the guest — measurably improve play-through on Spotify and Apple. The fix is templating. Here is the version where the cover is HTML and the episode metadata is a CSV row. ## What a podcast cover needs Apple Podcasts requires 3000×3000 minimum, RGB, JPG or PNG. The visual hierarchy that works for thumbnails *and* lock-screen art: 1. **Episode number** — the largest element. Reads from 100px away on a phone. 2. **Episode title** — the second-largest. Two lines max. 3. **Guest name** — small, on the bottom. 4. **Show name** — top, identifiable but quiet. The trap most podcasts fall into is making the show name biggest. The listener already knows the show name; they tapped on the show to get here. The episode-specific information is what they need next. ## The template A single HTML document with the four variable knobs: <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:white;font-family:ui-sans-serif,system-ui;aspect-ratio:1/1;width:100%;height:100vh;display:grid;grid-template-rows:auto 1fr auto;padding:12%;} header{display:flex;justify-content:space-between;font-size:14px;letter-spacing:.3em;text-transform:uppercase;opacity:.7;} .ep{font-size:160px;font-weight:900;letter-spacing:-.06em;line-height:.85;color:{{$ACCENT}};} .title{font-size:48px;font-weight:700;line-height:1.05;letter-spacing:-.03em;max-width:80%;} .guest{margin-top:18px;font-size:22px;opacity:.75;} footer{font-size:14px;letter-spacing:.25em;text-transform:uppercase;opacity:.5;display:flex;justify-content:space-between;}</style> <header><span>{{$SHOW}}</span><span>Ep · {{$NUM}}</span></header> <div><div class="title">{{$TITLE}}</div><div class="guest">with {{$GUEST}}</div></div> <footer><span>{{$DATE}}</span><span>{{$SHOW_URL}}</span></footer>`} knobs={[ { name: "SHOW", label: "Show", default: "Frame by Frame" }, { name: "NUM", label: "Episode #", default: "042" }, { name: "TITLE", label: "Episode title", default: "The Art of Determinism" }, { name: "GUEST", label: "Guest", default: "Kira Tanaka" }, { name: "DATE", label: "Date", default: "May 2026" }, { name: "SHOW_URL", label: "URL", default: "hyperframes.fm" }, { name: "ACCENT", label: "Accent", type: "color", default: "#ff3b1f" }, { name: "BG", label: "Background", type: "color", default: "#0a0a0a" } ]} height={420} caption="Eight knobs cover most podcast cover use cases." /> ## Per-episode rendering The CSV that drives it: ```csv num,title,guest,date 042,The Art of Determinism,Kira Tanaka,2026-05-12 043,Why HTML Won,Marcus Okafor,2026-05-19 044,Render Once Run Forever,Ren Park,2026-05-26 ``` The render loop, in pseudo-code: ```ts for (const row of episodes) { const html = template .replace(/\{\{\$NUM\}\}/g, row.num) .replace(/\{\{\$TITLE\}\}/g, row.title) .replace(/\{\{\$GUEST\}\}/g, row.guest) .replace(/\{\{\$DATE\}\}/g, row.date); await renderHtmlToImage(html, { width: 3000, height: 3000, format: 'jpg' }); } ``` Same shape as the [CSV personalized video](/blog/batch-personalized-videos-from-csv) pattern, just rendering to an image instead of an MP4. The HyperFrames render API supports both. ## Multi-platform variants Most podcast platforms accept the 3000×3000 master. A few want different aspects: - **Spotify Canvas** — 1080×1920 vertical animated cover (yes, podcasts can have animated covers now). Render the same template at 9:16 with the animation on. - **YouTube** — 1280×720 thumbnail for the video version. Same template at 16:9. - **Twitter share card** — 1200×628. If you build the template with `@container` queries (the same trick from [social cards in every ratio](/blog/social-media-cards-every-ratio)), all four come from one source. ## Things that look obvious in hindsight After shipping per-episode covers for two seasons of one of our shows, things we wish we had done earlier: 1. **Lock the type scale.** Episode titles vary wildly in length. Auto-fit the title to a fixed bounding box with `font-size: clamp()`. Otherwise the layout breaks on long titles. 2. **Reserve space for the guest.** Some episodes have no guest. Leave the slot in the layout; render empty. Otherwise the title creeps down on no-guest episodes and the visual rhythm breaks. 3. **Version the template.** Episode 42's cover should render correctly forever. If you change the template, version it; old episodes keep rendering from the old template hash. The last one is the determinism lesson generalized to assets. See [the deterministic rendering manifesto](/blog/deterministic-video-manifesto) for why this matters in CI. ## Wiring it into the publishing flow If you publish via Buzzsprout, Transistor, Castos, or any podcast host: most have an API. The flow is: 1. Episode metadata gets written to a CSV (or a Notion DB, or your CMS). 2. CI renders the cover from the template + metadata. 3. Cover uploads to the host alongside the audio. 4. Done. For shows that ship every week, this is fifteen minutes of setup that pays back forever. The cover stops being a craft step; it becomes a build step. ## Why episode-specific covers matter The play-through data, from the shows we have measured: episodes with a per-episode cover get 12-18% more plays than episodes with the show-default cover. The hypothesis is intuitive: a listener scrolling their library sees the *episode* rather than the *show*; the cover acts as a thumbnail. The cost of per-episode covers used to be a designer's afternoon. With a template, the cost is zero per episode after the first. The math is unambiguous. The [marketing use cases](/use-cases/marketing) page has the longer story. The [HyperFrames render API](/tools/html-to-video) is the underlying tool. If you ship a podcast, this is one of the highest-leverage things you can template. --- # Make a YouTube intro from code (and render every episode's) URL: https://hyperframes.video/blog/youtube-intro-from-code Published: 2026-05-10T10:00:00.000Z Tags: youtube, intro, template, tutorial Author: hf-team Most YouTube channels have one intro that has not changed since episode 3. There is a reason: re-doing it in After Effects is a half-day, and "the intro is fine" beats "I am rebuilding the intro." Until the rebrand. Then it is a week. Here is the version where the intro is an HTML template, the episode number is a variable, and re-rendering for a new season takes thirty seconds. ## What a 5-second intro needs The list, in order of priority: 1. **Brand mark or wordmark.** The single most important element. 2. **A pacing beat.** Something that lands at ~1.5s so the eye does not get bored. 3. **A tagline or category.** Optional but loved by viewers who skip the first second. 4. **An exit cue.** A small motion at 4-5s that signals "content starts now." That is it. More than that is too much for a 5-second window. ## The template The whole thing in 30 lines: ```html <!doctype html> <html data-duration="5" data-aspect="16:9"> <head><style> body { margin: 0; background: {{$BG}}; height: 100vh; display: grid; place-items: center; color: white; font-family: ui-sans-serif, system-ui; } .stage { text-align: center; } .bar { width: 0; height: 6px; background: {{$ACCENT}}; margin: 24px auto; border-radius: 4px; } .chan { font-size: 132px; font-weight: 900; letter-spacing: -.05em; opacity: 0; transform: translateY(20px); } .tag { font-size: 18px; letter-spacing: .5em; text-transform: uppercase; margin-top: 18px; opacity: 0; } </style></head> <body> <div class="stage"> <div class="bar" id="bar1"></div> <div class="chan" id="chan">{{$CHANNEL}}</div> <div class="bar" id="bar2"></div> <div class="tag" id="tag">{{$TAGLINE}}</div> </div> <script> function ease(t){ return 1 - Math.pow(1-t, 3); } function clamp(x){ return Math.max(0, Math.min(1, x)); } function render(t) { document.getElementById('bar1').style.width = (ease(clamp((t-0.1)/0.6)) * 60) + '%'; var c = ease(clamp((t-0.8)/0.8)); document.getElementById('chan').style.opacity = c; document.getElementById('chan').style.transform = 'translateY(' + (1-c)*20 + 'px)'; document.getElementById('bar2').style.width = (ease(clamp((t-1.6)/0.5)) * 30) + '%'; document.getElementById('tag').style.opacity = ease(clamp((t-2.0)/0.6)); } addEventListener('hf-seek', e => render(e.detail.time)); render(0); </script> </body></html> ``` Four variables: `BG`, `ACCENT`, `CHANNEL`, `TAGLINE`. Everything else is a design decision frozen in the template. ## Make it yours <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};height:100vh;display:grid;place-items:center;color:white;font-family:ui-sans-serif,system-ui;} .s{text-align:center;} .b{width:60%;height:4px;background:{{$ACCENT}};margin:18px auto;border-radius:4px;} .c{font-size:84px;font-weight:900;letter-spacing:-.05em;} .t{font-size:14px;letter-spacing:.5em;text-transform:uppercase;margin-top:14px;opacity:.7;}</style> <div class="s"><div class="b"></div><div class="c">{{$CHANNEL}}</div><div class="b" style="width:30%;"></div><div class="t">{{$TAGLINE}}</div></div>`} knobs={[ { name: "CHANNEL", label: "Channel", default: "HyperFrames" }, { name: "TAGLINE", label: "Tagline", default: "videos that render themselves" }, { name: "ACCENT", label: "Accent", type: "color", default: "#ff3b1f" }, { name: "BG", label: "Background", type: "color", default: "#0a0a0a" } ]} /> ## Per-episode variants The whole point of doing this in code: every episode can have its *own* intro. Episode-specific knobs to consider: - **Episode number** overlaid in the corner (small, monospaced). - **Season tagline** replacing the static one. - **Color shift** for thematic episodes (a darker accent for the finale). A weekly show with 52 episodes gets 52 unique intros for the cost of running the render 52 times. The first one took an hour; the next 51 take twelve minutes total. ## Render specs that matter for YouTube YouTube's encoder is forgiving but punishes a few specific choices: - **Resolution.** 1920×1080, even if you plan a 4K master. The intro will be re-encoded; you do not need the extra detail. - **FPS.** 30 or 60. Avoid 25/24 unless your full content runs at those rates — the mismatch causes a stutter at the intro/content boundary. - **Bitrate.** Render at 8-12 Mbps. YouTube will re-encode, but if the source is too low, the re-encode amplifies artifacts. - **No audio.** Keep the intro silent or pair it with a single short stinger. The viewer's audio sources for the rest of the video will not match a busy intro track. ## Wire it into your publishing flow If you publish on a schedule, wire this into your CI. A [GitHub Actions matrix](/integrations/github-actions) renders the intro for each episode in your queue. Outputs land in S3; your publishing tool grabs them. The intro is now a build artifact, not a craft asset. ## The variant economy The interesting thing about code-driven intros is not that the first one is cheaper — it is not. The interesting thing is that the second through hundredth are free. Most channels operate under "we can afford one intro." Once you can afford a hundred, the editorial choices change. The [marketing use cases](/use-cases/marketing) page has examples from teams that ship per-episode intros at scale. The [HyperFrames render API](/tools/html-to-video) is the underlying infrastructure. The pattern, repeated until it stops being novel: video template + variable input = build artifact. Same idea as the static-site generator did for the web. We are running the same play, for video. --- # Instagram Reels automation: render 50 reels from one HTML template URL: https://hyperframes.video/blog/instagram-reels-automation Published: 2026-05-10T09:00:00.000Z Tags: instagram, reels, automation, 9:16, tutorial Author: ren-park Instagram's algorithm rewards volume, but the production tools assume one editor per video. The math doesn't work past ten reels a week. The way out is to template the format — once — and render variants from a spreadsheet. This guide walks the engineering side of a Reels program: the 9:16 canvas, the safe zones Instagram covers, the audio-sync constraints, and the batch render setup that ships fifty videos overnight. ## The Reels canvas Instagram Reels is `1080 × 1920` at 30fps. The platform overlay budget is slightly more generous than TikTok but follows the same shape: - **Bottom 220px**: caption + username + audio attribution. - **Right 160px, lower-third area**: like / comment / share / "use audio" rail. - **Top 90px**: status bar in the iOS app, less critical but visible. Design every shot to clear those zones. The viewer-actionable safe area is roughly 800px wide × 1500px tall, centered, biased slightly upward. <InlineSandbox html={`<!doctype html> <html><body style="margin:0;background:#0a0a0a;display:grid;place-items:center;min-height:100vh;font-family:ui-sans-serif,system-ui;"> <div style="position:relative;width:270px;height:480px;background:radial-gradient(at 30% 20%,#ff3b1f,transparent 50%),radial-gradient(at 70% 80%,#7b2cff,transparent 50%),#0a0a0a;border-radius:24px;overflow:hidden;color:#fff;"> <div style="position:absolute;inset:24px 14px 64px 14px;outline:2px dashed rgba(255,255,255,.4);border-radius:14px;display:grid;place-content:center;text-align:center;"> <div style="font-weight:800;font-size:28px;letter-spacing:-.02em;line-height:1.05;">SAFE ZONE</div> <div style="font-size:10px;letter-spacing:.2em;text-transform:uppercase;opacity:.6;margin-top:6px;">800 × 1500</div> </div> <div style="position:absolute;right:6px;bottom:74px;display:grid;gap:12px;"> <div style="width:24px;height:24px;border-radius:50%;background:rgba(255,255,255,.18);"></div> <div style="width:24px;height:24px;border-radius:50%;background:rgba(255,255,255,.18);"></div> <div style="width:24px;height:24px;border-radius:50%;background:rgba(255,255,255,.18);"></div> </div> <div style="position:absolute;left:14px;right:44px;bottom:10px;font-size:10px;line-height:1.3;"> <div style="font-weight:700;">brand.studio · Original audio</div> <div style="opacity:.85;margin-top:2px;">Caption with a thought. Maybe a hashtag. #reels</div> </div> </div> </body></html>`} height={520} caption="Reels canvas — the dashed region is what survives Instagram's UI overlays." /> ## The four-shot structure Reels under thirty seconds get the highest reach. A reliable four-shot template: | Shot | Duration | Job | |---|---|---| | Hook | 0.0–1.5s | One sentence, big type, single visual idea | | Setup | 1.5–6s | Context — why does the hook matter | | Payoff | 6–18s | The actual content (data, demo, quote) | | CTA | 18–24s | Handle + link in bio + a single image | Each shot is its own scene in the HTML — the timeline is just `data-start` and `data-end` attributes plus CSS transitions. No timeline software. ## Audio: the one thing you cannot deterministically render Audio is the asterisk on "render everything from code." Reels lives or dies by audio, and you cannot embed Instagram's licensed library into your render — the platform requires audio be added in-app to track the licensing. Two workable patterns: 1. **Render silent MP4, add audio in the Instagram app** at upload time. Slow, but lets you use the algorithmic-favorite Instagram-library tracks. 2. **Render with your own licensed audio embedded.** Faster, but the algorithm slightly under-distributes videos with non-library audio. We default to pattern 1 for the first version of a campaign and switch to pattern 2 once a sound shows traction. ## The variable surface Five variables, no more: - **Hook text** (one line) - **Body content** (text or asset URL) - **CTA line** - **Color theme** (one accent color drives everything) - **Duration override** (default 24s; some hooks need 8s) <VariableKnobs html={`<style>body{margin:0;background:#000;height:480px;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .r{position:relative;width:270px;height:480px;background:radial-gradient(at 30% 20%,{{$ACCENT}}aa,transparent 50%),radial-gradient(at 70% 80%,{{$ACCENT}}55,transparent 60%),#0a0a0a;border-radius:24px;overflow:hidden;color:#fff;} .hk{position:absolute;left:18px;right:18px;top:80px;font-weight:800;font-size:30px;letter-spacing:-.02em;line-height:1.05;} .bd{position:absolute;left:18px;right:18px;top:240px;font-size:14px;line-height:1.5;opacity:.92;} .cta{position:absolute;left:14px;right:14px;bottom:74px;background:rgba(255,255,255,.08);backdrop-filter:blur(10px);border:1px solid rgba(255,255,255,.15);border-radius:14px;padding:10px 14px;display:flex;justify-content:space-between;align-items:center;} .cta b{font-size:13px;font-weight:700;} .cta i{color:{{$ACCENT}};font-style:normal;font-size:13px;font-weight:600;} </style> <div class="r"> <div class="hk">{{$HOOK}}</div> <div class="bd">{{$BODY}}</div> <div class="cta"><b>@{{$HANDLE}}</b><i>{{$CTA}} →</i></div> </div>`} knobs={[ { name: "HOOK", label: "Hook", default: "We replaced our scheduler with 12 lines of cron." }, { name: "BODY", label: "Body", default: "Three months later: zero incidents, no on-call." }, { name: "HANDLE", label: "Handle", default: "ourstartup" }, { name: "CTA", label: "CTA", default: "Read the post" }, { name: "ACCENT", label: "Accent", type: "color", default: "#ff3b1f" } ]} height={520} /> ## Batch pipeline Once the template is solid, the throughput math is mechanical: 1. CSV with one row per reel. 2. Render queue produces a `1080×1920` MP4 per row. 3. Thumbnails generate for review. 4. Approved videos sync to a Buffer / Later / Hootsuite uploader. [See the full CSV pipeline](/blog/batch-personalized-videos-from-csv) for the orchestration code. ## Throughput math A team running this pipeline (one creative, one engineer, weekly batch) ships: Most of which never get used. That is the point — at this volume, the marketer picks the top 20% and discards the rest, which is the inverse of the "one editor per video" workflow where every video is precious by virtue of how long it took. ## The other formats come free Once a Reel exists as HTML, the same template at `1080×1080` is an in-feed post, at `1080×1350` is a portrait post, at `1920×1080` is a Facebook-cross-post hero. Five formats from one source. [Open the playground](/playground), build the first template, queue the batch. --- # Turn any SVG animation into a real MP4 URL: https://hyperframes.video/blog/svg-animation-to-mp4 Published: 2026-05-09T16:00:00.000Z Tags: svg, mp4, animation, tutorial Author: ren-park SVG is the best animation format on the web that almost nobody uses as a *source* format for video. You can author once in SVG, render to MP4 for social, render to GIF for email, render to a PNG sequence for an editor. The asset is reproducible, version-controlled, and small. Here is how to render the three main flavors of animated SVG — SMIL, CSS, and JS-driven — to a deterministic MP4. ## The three kinds of SVG animation Almost every animated SVG in the wild is one of these: 1. **SMIL** — the legacy `<animate>` and `<animateTransform>` tags. Native, no JS, declarative. 2. **CSS-animated SVG** — SVG elements with CSS `animation` or `transition` applied. Standard web animation, applied to SVG. 3. **JS-driven SVG** — JS reads the clock and mutates SVG attributes per frame. Maximum control. All three can be rendered to MP4. The trick is each one needs a slightly different bridge to make the renderer scrub deterministically. ## Bridge 1: SMIL SMIL animations run on their own internal clock. To make a SMIL SVG renderable, you need to expose a "seek to time" hook. The standard pattern: ```html <svg> <circle r="20" cy="100"> <animate attributeName="cx" from="0" to="800" dur="4s" repeatCount="indefinite" /> </circle> <script> addEventListener('hf-seek', e => { document.documentElement.setCurrentTime(e.detail.time); }); </script> </svg> ``` `setCurrentTime` is the SVG-native way to scrub. The HyperFrames renderer dispatches `hf-seek` events; this two-line bridge wires them up. ## Bridge 2: CSS-animated SVG CSS animations on SVG elements have the same problem as CSS animations on HTML: they want to run on a wall clock you do not control. The fix is the same — pause the animation and set `animation-delay` to a negative value per frame. ```javascript addEventListener('hf-seek', e => { document.querySelectorAll('[data-anim]').forEach(el => { el.style.animationPlayState = 'paused'; el.style.animationDelay = `-${e.detail.time}s`; }); }); ``` This works for any CSS animation, not just SVG. The negative delay scrubs the animation to the requested time and the paused state freezes it there. ## Bridge 3: JS-driven SVG JS-driven is the easiest because you already have a clock you control. Just call your `render(t)` function with the seek time: ```javascript function render(t) { circle.setAttribute('cx', t * 200); } addEventListener('hf-seek', e => render(e.detail.time)); render(0); ``` Pure function of time, no internal state. This is the same pattern we use for [particle effects](/blog/css-confetti-particle-effect) and [data viz](/blog/animated-bar-chart-tutorial) — and it is the only one of the three that gives byte-identical renders across machines. <InlineSandbox height={300} html={`<svg viewBox="0 0 800 300" xmlns="http://www.w3.org/2000/svg" style="background:#0a0a0a;width:100%;height:100%;display:block;"> <circle id="c" cx="0" cy="150" r="40" fill="#ff3b1f" /> <text id="t" x="40" y="290" fill="#999" font-family="ui-monospace,monospace" font-size="12">t = 0.00s</text> </svg> <script> function render(t) { var u = (t % 4) / 4; var ease = 1 - Math.pow(1 - u, 3); document.getElementById('c').setAttribute('cx', 80 + ease * 640); document.getElementById('t').textContent = 't = ' + t.toFixed(2) + 's'; } addEventListener('hf-seek', function(e) { render(e.detail.time); }); render(0); </script>`} caption="A pure-function-of-t SVG animation. Scrub anywhere, it renders correctly." /> ## Paste your own SVG If you have an existing SVG animation, the conversion is usually: 1. Find the animation driver (SMIL, CSS, JS). 2. Add the appropriate bridge above. 3. Test by scrubbing — does the SVG look right at t=0, t=2, t=4? If scrubbing looks broken, it is almost always because the animation has internal state (a `setInterval`, an accumulator). Replace those with pure functions of `t`. See our [DOM-to-MP4 walkthrough](/blog/from-dom-to-mp4) for the longer version of this story. ## The render Once the SVG scrubs cleanly, render it. Open the [HyperFrames playground](/playground), paste the SVG (wrap it in `<!doctype html><html><body>...`), set duration, render. You get an MP4. ## Why MP4 and not GIF GIF is 256 colors, no audio, and 30-50× the file size of equivalent MP4. The only case for GIF is "the platform does not allow MP4" — usually email. For everything else, MP4 wins on quality and size. We covered the [GIF-vs-MP4 tradeoff in detail](/blog/gif-to-mp4-the-right-way) if you want the numbers. ## The reproducibility win The reason this matters: an SVG you rendered to MP4 last quarter, with the same template and same data, renders to the *same bytes* today. That is not true of any timeline-tool export. Determinism is the feature that compounds — every render after the first is a build, not a craft. SVG-as-source is one of the cleanest ways to get there. --- # Replacing After Effects with a text editor URL: https://hyperframes.video/blog/replacing-after-effects-with-a-text-editor Published: 2026-05-09T14:00:00.000Z Tags: workflow, design, after-effects, comparison Author: marcus-okafor I learned After Effects in 2017. I spent the next eight years there. I shipped network IDs, election graphics, app launch films, twenty-five-second ads that ran on television, a four-minute explainer for a Series B that ended up on the homepage of Hacker News. AE was my native language, and I was, by most measures, fluent. A year ago I switched. The composition I am writing right now — a 30-second product reveal for a fintech client — is open in VS Code, not AE. The next composition, due Friday, will be in VS Code too. I am not coming back. I want to tell you what that transition was like, because it is the most concrete answer I can give to the question "is HyperFrames really a replacement for After Effects, or is it for a different audience?" The short answer is: it is for the same audience, doing the same job, with different ergonomics. The longer answer is below — and if you want the feature-by-feature breakdown, the [HyperFrames vs After Effects comparison](/compare/after-effects) has it. ## What I gained The first thing I gained is *diff*. I had not realized how much of my professional anxiety in AE was generated by the fear of losing work. AE projects accumulate. You have v3, v3-final, v3-final-CLIENTREVIEW, v3-final-CLIENTREVIEW-marcus-edit, v4. The "what changed since the last review" question is, at best, answerable by opening both files side by side and squinting. At worst, it is unanswerable. You have lost the audit trail of your own thinking. When the composition is HTML, the diff is the diff. Every edit is a line. Every revision is a commit. Every client comment becomes a branch. When the client says "go back to the version where the title was bigger," I check out the commit from three days ago. The whole concept of "saving over the file" is gone. The second thing I gained is *reusability that actually works*. AE has expressions. AE has scripts. AE has Essential Graphics. I used all of them. None of them ever felt like real reuse. The pre-comp I built for one project never quite plugged into the next project. The expression I wrote for one text layer was a copy-paste away from the next text layer, with subtle differences I had to remember. In HTML, a lower third is a `<div>` with classes. I reuse the same `<div>` across thirty compositions. When the brand color updates, I change one CSS variable. When the type system updates, I change one font import. The whole notion of a design system becomes practical, because every composition is built from the same primitives. The third thing I gained is *agents that work*. I am not exaggerating when I say this changed my month-to-month output. When I need a routine title card — name, role, transition — I describe it to Claude in one sentence, and it produces the HTML in seconds. I review the output, tweak one thing, render. The bottleneck used to be the routine work; now the routine work is something I delegate. I spend my time on the parts that take taste. ## What I lost I want to be honest about this part, because I have read too many "I switched and never looked back" essays where the writer is performatively enthusiastic. I lost real things. I lost *the timeline*. AE's timeline view, where you can see every layer with its keyframes laid out horizontally, is genuinely a great UI for thinking in time. HTML does not have it. I have something resembling it — HyperFrames' preview pane shows a scrubber, and I can scrub through a composition — but I cannot see the structure of every animation at a glance the way I could in AE. The timeline is a loss. I lost *direct manipulation of bezier handles*. In AE, when I do not like an ease, I grab the handle and pull. In HTML, I write `cubic-bezier(.16, 1, .3, 1)` and check the result. The feedback loop is slower. I have a browser extension (cubic-bezier.com) that I use to interactively tune curves and then paste the values into CSS. It is fine. It is not as fast as dragging the handle. I lost *the visual asset browser*. AE's project panel, with thumbnails of every clip and image, was a way of *seeing* my materials. In HTML, my materials are file paths. I keep a separate folder open in Finder and refer to it. This is not catastrophic, but it is friction. I lost *some advanced effects*. There are things AE does that the browser does not, or does badly. True 3D camera (not CSS perspective, real 3D camera with depth-of-field). Particle systems with millions of points. Optical flow time remapping. Plug-in ecosystems like Trapcode. For these I still occasionally open AE and render a clip to MP4, which I then composite into my HyperFrames timeline. The escape hatch matters. ## What surprised me A few things I did not expect. I expected typography to be a downgrade. It is not. CSS has variable fonts, OpenType features, optical sizing, color fonts, and a level of typographic control that AE has never matched. The serif headlines I am setting in HyperFrames look better than what I shipped in AE, because CSS handles type properly and AE handles type adequately. The first time I rendered a small-caps heading with stylistic alternates in a single line of CSS, I knew I was not going back. I expected rendering to be slower. It is faster. A 30-second 1080p60 composition that took 4 minutes to render in AE (with full quality, real shadows, all the trimmings) renders in 12 seconds in HyperFrames. The reason is that the browser is rendering frames at 60fps live anyway; capturing them to disk is mostly the cost of PNG encoding. AE is doing far more work per frame because AE supports far more per-frame complexity. For the work I do — editorial motion, charts, type — the browser is more than enough, and the speed difference is enormous. I expected to miss the community. AE has decades of tutorials, presets, templates, asset packs. HyperFrames has a year. But here is what I underestimated: the broader web design community is the HyperFrames community. Every CSS animation tutorial on the internet is a HyperFrames tutorial. Every typography pattern from Practical Typography or Butterick is a HyperFrames pattern. The asset library is the entire web. ## A typical day, before and after Let me describe a real workflow shift, because abstractions are useless. **Before.** A client asks for a 15-second product reveal. I open AE. I import the brand kit (a project file someone else built two years ago, with conventions I do not entirely remember). I set up a new composition, 1920x1080, 30fps. I import the product render (a 4K PNG sequence the 3D team gave me). I build the title sequence — text layer with a "Pop-Up" preset, modified. I tweak the easing. I render to a temp file. I send it to the client. They ask for the title to be bigger. I move the slider. I re-render. I send. They ask for a different color. I move the color picker. I re-render. I send. By end of day, I have five rounds of revisions, five exports, five DM links, and a sense of mild dread about which version is the canonical one. **After.** Same brief. I open VS Code. I open the brand `brand.css` (a single file, version-controlled). I copy `title-card.html` from a sibling project — same brand, different copy. I change the copy. I open the preview pane (HyperFrames' `npx hyperframes preview` runs on `:3000`). I see the result live. The client is on a call; I share my screen. They say make the title bigger. I change `font-size: 96px` to `font-size: 112px`. They see it instantly. They say change the color. I change one variable. Instantly. I commit each change with a message describing what they asked for. I render once, at the end, when we are aligned. The final MP4 has a sidecar manifest pointing to the exact commit that produced it. The day-to-day texture is different. I am no longer waiting for renders. I am no longer in an asynchronous tennis match of "send v3, get notes, render v4." The client and I move together. ## What it took to switch I want to be honest about the cost of switching, because it was nonzero. The first two weeks were rough. I knew HTML and CSS — I had built websites — but I did not know motion in CSS at the level I knew motion in AE. I had to learn the idioms: `animation-delay` as a clock, character splitting, the cubic-bezier curves that map to the AE easing presets I had in muscle memory. Reading the [motion graphics in 80 lines](/blog/motion-graphics-in-80-lines) post would have saved me a week. The next month was the unlearning. AE had patterns that did not translate. "Pre-comp this and animate the pre-comp" became "wrap in a div and animate the div." "Use a track matte" became "use `mask-image`." "Use the puppet pin tool" became... not possible, mostly. I had to retire effects I had built my career on. By month three I was as productive as I had been in AE. By month six I was more productive. By month nine the agent loop kicked in and I started shipping volume I could not have produced in AE with twice the hours. ## Should you switch? Here is my honest matrix. **Switch if:** Your work is editorial, type-heavy, brand-driven, repetitive across many variants, or needs to plug into developer workflows (CI, version control, agents). You will be faster, your output will be more consistent, and your files will outlive your employment. **Do not switch if:** Your work is heavy 3D, particle systems, advanced compositing, or anything that depends on a specific AE plug-in. The browser is not a replacement for Cinema 4D plus AE plus Mocha. **Hedge if:** Your work is mixed. Use HyperFrames for the editorial layer (titles, lower thirds, charts, transitions) and AE for the effects work. Composite the AE outputs as MP4 clips inside your HyperFrames timeline. This is what I do for the rare project that needs both. I do not think After Effects is going away. It is the right tool for a specific job, and that job is still being done. What I do think is that the field of "motion graphics" — which has lived inside AE for thirty years — is splitting. The editorial, brand-driven, agent-touchable half is moving to text. The cinematic, effects-heavy, manually-tuned half is staying. Both halves will be larger, faster, and more confident in five years than they are today. The text editor is the new timeline. It looks different. It works better than you would expect. Open a new file — or try the [in-browser playground](/playground) without installing anything. See for yourself. If you came up through declarative-canvas tools, the [Motion Canvas comparison](/compare/motion-canvas) is worth a read too. --- # Animate a stock chart with real data URL: https://hyperframes.video/blog/stock-chart-animation Published: 2026-05-08T16:00:00.000Z Tags: stock-chart, data-viz, animation, tutorial Author: kira-tanaka An animated stock chart is one of those things that looks intimidating until you build one. The data is a 1-D array of prices. The chart is an SVG `<polyline>`. The animation is `stroke-dashoffset`. There are four lines of math. Here is the version, with a real-data shape and the windowing trick that keeps long time-series readable. ## The data Flat JSON, one point per time-step: ```json { "ticker": "EXAMPLE", "points": [ { "t": "2026-04-01", "p": 142.30 }, { "t": "2026-04-02", "p": 144.15 }, { "t": "2026-04-03", "p": 140.95 } // ... 60 more ] } ``` Sort by `t` ascending. Compute `min` and `max` for the y-axis. The rest is layout. ## The SVG ```html <svg viewBox="0 0 800 300" preserveAspectRatio="none"> <polyline id="line" fill="none" stroke="#ff3b1f" stroke-width="2" points="0,250 50,220 100,240 150,180 200,160 ..." /> </svg> ``` Compute the `points` from data: ```js function build(points, w, h) { const ps = points.map(d => d.p); const lo = Math.min(...ps), hi = Math.max(...ps), range = hi - lo; return points.map((d, i) => { const x = (i / (points.length - 1)) * w; const y = h - ((d.p - lo) / range) * h; return `${x},${y}`; }).join(' '); } ``` That is the whole chart. Everything else is decoration. ## The reveal animation The classic SVG trick: animate `stroke-dasharray` and `stroke-dashoffset` so the line draws itself left to right. ```js const line = document.getElementById('line'); const len = line.getTotalLength(); line.style.strokeDasharray = len; line.style.strokeDashoffset = len; line.style.transition = 'stroke-dashoffset 2s cubic-bezier(.16, 1, .3, 1)'; requestAnimationFrame(() => { line.style.strokeDashoffset = 0; }); ``` `getTotalLength()` returns the polyline's path length in user units. Setting `stroke-dasharray` to that length and offsetting by the same amount hides the entire line. Animating the offset to 0 reveals it. For [deterministic rendering](/blog/deterministic-video-rendering-ci), drive the dashoffset as a function of `t` instead of with a CSS transition: ```js addEventListener('hf-seek', e => { const u = Math.min(1, e.detail.time / 2); const ease = 1 - Math.pow(1 - u, 3); line.style.strokeDashoffset = len * (1 - ease); }); ``` ## The windowing trick The hardest part of long time-series: 252 trading days at 4px each is over 1000px wide. On a 16:9 canvas, the early data points are unreadable. The fix: window the data. Show 20-40 points at a time. As time advances, the window slides right; old points scroll off-screen. ```js function renderWindow(allPoints, t, windowSize, totalDuration) { const progress = t / totalDuration; const start = Math.floor(progress * (allPoints.length - windowSize)); return allPoints.slice(start, start + windowSize); } ``` The eye reads this as a *zooming through time* chart. Much more legible than the static-frame version. ## The endpoint flash A small detail that elevates the chart: a pulsing dot on the most recent data point. The eye is drawn to motion; the pulse says "this is where you are." ```html <circle id="head" r="6" fill="#ff3b1f"> <animate attributeName="r" values="6;10;6" dur="1.5s" repeatCount="indefinite" /> <animate attributeName="opacity" values="1;0.4;1" dur="1.5s" repeatCount="indefinite" /> </circle> ``` Update `cx` and `cy` on every frame to track the last drawn point. The pulse is purely decorative — it carries no data — but it gives the chart life. ## The complete picture <CodeTabs tabs={[ { label: "data.json", code: `{ "ticker": "EXAMPLE", "points": [ { "t": "2026-04-01", "p": 142.30 }, { "t": "2026-04-02", "p": 144.15 }, { "t": "2026-04-03", "p": 140.95 }, { "t": "2026-04-04", "p": 146.20 }, { "t": "2026-04-05", "p": 151.05 }, { "t": "2026-04-08", "p": 149.80 }, { "t": "2026-04-09", "p": 153.40 }, { "t": "2026-04-10", "p": 158.25 } ] }` }, { label: "template.html", code: `<svg viewBox="0 0 800 300" preserveAspectRatio="none"> <polyline id="line" fill="none" stroke="#ff3b1f" stroke-width="2.5" /> <circle id="head" r="6" fill="#ff3b1f" /> </svg> <script> const data = window.__DATA__; const w = 800, h = 300; const ps = data.points.map(d => d.p); const lo = Math.min(...ps), hi = Math.max(...ps), range = hi - lo; const pts = data.points.map((d, i) => { const x = (i / (data.points.length - 1)) * w; const y = h - ((d.p - lo) / range) * h; return [x, y]; }); const line = document.getElementById('line'); line.setAttribute('points', pts.map(p => p.join(',')).join(' ')); const len = line.getTotalLength(); line.style.strokeDasharray = len; const head = document.getElementById('head'); addEventListener('hf-seek', e => { const u = Math.min(1, e.detail.time / 3); const ease = 1 - Math.pow(1 - u, 3); line.style.strokeDashoffset = len * (1 - ease); const i = Math.min(pts.length - 1, Math.floor(u * (pts.length - 1))); head.setAttribute('cx', pts[i][0]); head.setAttribute('cy', pts[i][1]); head.style.opacity = u > 0.05 ? 1 : 0; }); </script>` }, { label: "Result", html: `<style>body{margin:0;background:#0a0a0a;color:white;font-family:ui-sans-serif,system-ui;display:grid;grid-template-rows:auto 1fr;padding:32px;height:100vh;}.h{display:flex;justify-content:space-between;align-items:baseline;}.t{font-size:48px;font-weight:800;letter-spacing:-.03em;}.p{font-size:36px;color:#1f8a5b;font-variant-numeric:tabular-nums;}svg{width:100%;height:100%;}</style> <div class="h"><div class="t">EXAMPLE</div><div class="p">$158.25 <span style="font-size:18px;">+11.2%</span></div></div> <svg viewBox="0 0 800 300" preserveAspectRatio="none"> <polyline fill="none" stroke="#ff3b1f" stroke-width="2.5" points="0,250 100,200 200,275 300,150 400,80 500,120 600,40 700,15 800,0" /> <circle r="6" fill="#ff3b1f" cx="800" cy="0" /> </svg>` } ]} caption="Data, template, and rendered result. Same three artifacts every chart needs." /> ## When to render to MP4 A chart as a static image is useful. A chart as an MP4 is *more* useful in three specific cases: 1. **Social media** — a 6-second animated price chart gets dramatically more attention than a screenshot. 2. **Earnings calls** — embedded in slides, the chart draws itself in front of the audience. 3. **Newsletters** — modern email clients support MP4 inline; an animated chart in a financial newsletter reads as broadcast-quality. For all three, the [render pipeline](/integrations/nextjs) takes the JSON + template and returns an MP4. See [the live bar chart tutorial](/blog/animated-bar-chart-tutorial) for the simpler data shape and pattern. ## The version with real data If you wire this to a real data feed: - **Polygon, Alpaca, or Finnhub** for equities. Free tiers cover ~5 calls/minute, enough for daily renders. - **CoinGecko** for crypto. Generous free tier. - **Your own database** if the data is internal (revenue, user counts, etc.). The render is the cheap part. The data layer is where most of the time goes. Once you have a reliable feed, the chart is a build artifact — same input, same output, every time. See [the developer overview](/developers) for the full SDK surface. A stock chart with motion is the kind of widget that used to be reserved for Bloomberg terminals. Now it is forty lines of SVG and a JSON file. The asymmetry is the entire point. --- # Render an MP4 from a Next.js API route (real example) URL: https://hyperframes.video/blog/render-video-from-nextjs-route Published: 2026-05-08T13:00:00.000Z Tags: nextjs, video, api, tutorial Author: kira-tanaka The pattern: a Next.js API route accepts a JSON payload, renders an HTML template, encodes it to MP4, and streams the result back. The end-user gets a personalized video; the team gets a single deployment artifact. Here is the real route I run in production, the template it renders, and the four edge cases that cost us a week of debugging. ## The route handler A Route Handler in Next.js App Router. Lives at `app/api/render/route.ts`. Takes a POST, returns a `video/mp4` response. ```ts // app/api/render/route.ts const payload = await req.json(); const html = buildTemplate(payload); const mp4 = await renderHtmlToMp4(html, { width: 1920, height: 1080, duration: 6, fps: 30, }); return new NextResponse(mp4, { headers: { 'content-type': 'video/mp4' }, }); } ``` The `runtime = 'nodejs'` is non-negotiable — the rendering uses headless Chromium, which does not run in Edge. The `maxDuration = 60` covers the slowest realistic render (six seconds at 1080p takes ~12s on warm infrastructure). ## The template `buildTemplate(payload)` returns an HTML string. The trick is to keep the template as plain HTML with `{{$VAR}}` placeholders, not JSX: ```ts function buildTemplate(p: { name: string; metric: string; delta: number }) { return `<!doctype html> <html><head><style> body { margin: 0; background: #f6f5f1; height: 100vh; display: grid; place-items: center; font-family: ui-sans-serif, system-ui; } .card { padding: 64px; background: white; border-radius: 24px; box-shadow: 0 30px 80px rgba(0,0,0,.1); } .hi { font-size: 24px; color: #6b6862; letter-spacing: .2em; text-transform: uppercase; } .name { font-size: 88px; font-weight: 800; margin: 12px 0; } .delta { font-size: 72px; font-weight: 800; color: #ff3b1f; font-variant-numeric: tabular-nums; } </style></head> <body><div class="card"> <div class="hi">Personalized for</div> <div class="name">Hi ${escapeHtml(p.name)}.</div> <div>Your ${escapeHtml(p.metric)} is up <span class="delta" id="d">0%</span> </div> </div> <script> var t=0; var T=${p.delta}; addEventListener('hf-seek',function(e){ t=e.detail.time; var u=Math.min(1, t/2); var e2=1-Math.pow(1-u,3); document.getElementById('d').textContent='+'+(T*e2).toFixed(1)+'%'; }); </script></body></html>`; } ``` Two things to notice: 1. **`escapeHtml`.** If `p.name` contains `<script>`, you want it as text, not code. Use a real escape utility (`he` package, or your framework's built-in). 2. **`addEventListener('hf-seek', ...)`.** The HyperFrames renderer dispatches `hf-seek` events at each frame's time. The template is a pure function of `t` — no `setInterval`, no animation loops. Frame N looks the same on every render. ## The complete picture <CodeTabs tabs={[ { label: "route.ts", code: `import { NextRequest, NextResponse } from 'next/server'; const payload = await req.json(); const html = buildTemplate(payload); const mp4 = await renderHtmlToMp4(html, { width: 1920, height: 1080, duration: 6, fps: 30, }); return new NextResponse(mp4, { headers: { 'content-type': 'video/mp4' }, }); }` }, { label: "template.ts", code: `function buildTemplate(p) { return \`<!doctype html>...\`; // see above }` }, { label: "Result", html: `<style>body{margin:0;background:#f6f5f1;height:100vh;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .card{padding:48px;background:white;border-radius:18px;box-shadow:0 20px 60px rgba(0,0,0,.1);} .hi{font-size:18px;color:#6b6862;letter-spacing:.2em;text-transform:uppercase;} .name{font-size:64px;font-weight:800;margin:8px 0;} .delta{font-size:48px;font-weight:800;color:#ff3b1f;font-variant-numeric:tabular-nums;}</style> <div class="card"><div class="hi">Personalized for</div><div class="name">Hi Jordan.</div><div>Your MRR is up <span class="delta">+24.6%</span></div></div>` } ]} caption="The route, the template, and what the rendered MP4 frame looks like." /> ## The four edge cases The bugs that cost us a week: ### 1. Fonts Custom fonts are async. Browsers will paint the page before the font loads, then re-paint when it arrives. The renderer captures frames; if it captures before the font loads, the MP4 has fallback typography. The fix: `document.fonts.ready` before the first frame. ```html <script> document.fonts.ready.then(() => { window.__hf_ready__ = true; }); </script> ``` The HyperFrames SDK respects `window.__hf_ready__` and waits before capturing. ### 2. The Vercel filesystem Vercel Functions have a read-only filesystem except for `/tmp` (512MB). If you cache rendered MP4s, write to `/tmp`. Better: stream the MP4 directly to the response and skip the cache. The route above does the latter. ### 3. Cold starts A Vercel cold start that includes spinning up headless Chromium is ~3s. For end-user-facing renders, you want this on a warm instance — set `preferredRegion` and consider Vercel's Function Concurrency. For batch renders, cold starts amortize fine. ### 4. CORS If the renderer fetches external assets (CSS, images, fonts from a CDN), CORS applies. Either: - Inline all assets into the HTML (base64 images, embedded fonts via `data:` URIs). - Set `Access-Control-Allow-Origin` headers on your CDN. Inlining is more reliable; CORS is more flexible. Pick based on which assets change. ## Streaming versus buffered The `renderHtmlToMp4` call above returns the full MP4 as a buffer. For larger renders (over 10s, or 4K), you want to stream chunks back as the encoder produces them: ```ts return new NextResponse(streamHtmlToMp4(html, opts), { headers: { 'content-type': 'video/mp4' }, }); ``` `streamHtmlToMp4` returns a `ReadableStream`. The browser starts playing the MP4 before the full encode completes — which, with `+faststart` flagged in the encoder, is the right UX. ## Wiring into a UI Once the route is live, the client side is trivial: ```ts const res = await fetch('/api/render', { method: 'POST', body: JSON.stringify({ name, metric, delta }), }); const blob = await res.blob(); const url = URL.createObjectURL(blob); videoRef.current.src = url; ``` The same pattern works for any [Next.js integration](/integrations/nextjs) or [Vercel deployment](/integrations/vercel). For higher-volume work, look at [batch CSV-driven rendering](/blog/batch-personalized-videos-from-csv) — the route stays the same, you just call it N times. ## What this unlocks The point of a render-route is that it turns video into a function call. Once `POST /api/render` exists, every other surface in your app — emails, dashboards, social cards — can fetch a personalized MP4 with no special pipeline. The HTML template is the contract; everything else is plumbing. See also: the [developers overview](/developers) for the full SDK surface, and [the deterministic rendering manifesto](/blog/deterministic-video-manifesto) for the principles underneath the API. --- # YouTube Shorts generator: programmatic Shorts from a template URL: https://hyperframes.video/blog/youtube-shorts-generator Published: 2026-05-08T09:00:00.000Z Tags: youtube, shorts, automation, 9:16, tutorial Author: kira-tanaka YouTube Shorts is the format with the most leverage right now — Google is putting Shorts in front of long-form viewers, the algorithm is undertested, and the audience tolerates lower production quality than TikTok or Reels. For a developer who can ship one solid template, this is a free distribution channel. The catch is volume. Shorts rewards posting daily, and "daily" plus "a CapCut session per video" plus "a job" do not coexist. The fix is the same fix as for any high-volume format: template once, render from data. ## The Shorts canvas Shorts is `1080 × 1920`, up to 60 seconds, MP4 with H.264 video and AAC audio. Unlike TikTok, YouTube's overlay budget is small — just a tap-to-pause hint and a subscribe pill in the bottom-right, both transparent. You can use almost the full canvas. Safe area worth reserving: **bottom 120px** for the subscribe pill, **top 60px** for the "Shorts" badge. ## What works on Shorts vs. what works on TikTok The two platforms feel similar but their audiences are calibrated differently. From running both in parallel for a year: | Pattern | TikTok | Shorts | |---|---|---| | Talking head with caption | High floor, low ceiling | Higher ceiling, lower floor | | Data-driven explainer | Medium | High — Shorts viewers carry over from long-form | | Product demo + voiceover | Medium | High | | Trend remix | High | Low — Shorts doesn't reward trends | | Pure text-on-screen | Low | Medium | The Shorts viewer is more patient and more interested in a payoff. Lean into that with templates that have a setup-and-conclusion structure rather than a hook-and-loop. ## A starter template Five elements, vertically stacked, biased to the upper two-thirds of the canvas: <InlineSandbox html={`<!doctype html> <html><head><style> @property --p { syntax: '<percentage>'; initial-value: 0%; inherits: false; } body{margin:0;background:#0a0a0a;display:grid;place-items:center;min-height:100vh;font-family:ui-sans-serif,system-ui;} .s{position:relative;width:270px;height:480px;background:#0d0d12;border-radius:18px;overflow:hidden;color:#fff;} .bar{position:absolute;left:0;right:0;top:0;height:3px;background:linear-gradient(90deg,#ff3b1f var(--p),rgba(255,255,255,.15) var(--p));animation:bar 30s linear infinite;} @keyframes bar { to { --p: 100%; } } .badge{position:absolute;top:14px;left:14px;font:600 10px ui-monospace,monospace;letter-spacing:.25em;text-transform:uppercase;color:#fff;background:rgba(255,255,255,.1);padding:4px 8px;border-radius:6px;} .q{position:absolute;left:18px;right:18px;top:80px;font-weight:800;font-size:28px;line-height:1.05;letter-spacing:-.02em;} .a{position:absolute;left:18px;right:18px;top:230px;} .a .row{display:flex;justify-content:space-between;padding:10px 14px;background:rgba(255,255,255,.05);border:1px solid rgba(255,255,255,.1);border-radius:10px;margin-bottom:8px;} .a .row.win{background:rgba(255,59,31,.15);border-color:#ff3b1f;} .a .row b{font-weight:700;font-size:13px;} .a .row i{font-style:normal;font-size:12px;opacity:.7;} .cap{position:absolute;left:14px;right:14px;bottom:80px;text-align:center;font-size:11px;letter-spacing:.2em;text-transform:uppercase;opacity:.6;} .sub{position:absolute;right:10px;bottom:14px;background:#ff0000;color:#fff;font-size:10px;font-weight:700;padding:6px 10px;border-radius:14px;} </style></head><body> <div class="s"> <div class="bar"></div> <div class="badge">SHORTS · 0:30</div> <div class="q">Which database is fastest at 10M rows?</div> <div class="a"> <div class="row"><b>Postgres</b><i>3.4s</i></div> <div class="row win"><b>SQLite (WAL)</b><i>1.1s</i></div> <div class="row"><b>MySQL 8</b><i>2.8s</i></div> <div class="row"><b>DuckDB</b><i>0.9s</i></div> </div> <div class="cap">benchmark · q2 2026 · single-node</div> <div class="sub">SUBSCRIBE</div> </div> </body></html>`} height={520} caption="A data-driven Shorts template — progress bar, question, leaderboard, CTA." /> The template above shows the data-explainer pattern that performs disproportionately well on Shorts: question, leaderboard, conclusion. It is one HTML file with four variables (the question, the rows, the highlight row, the subscribe handle). ## The variable surface ```ts type ShortsData = { question: string; rows: { label: string; value: string; highlight?: boolean }[]; caption: string; duration: number; // seconds, default 30 accent: string; // default #ff3b1f }; ``` Five fields, one of them an array. A JSON file with thirty entries gives you thirty Shorts. <VariableKnobs html={`<style>body{margin:0;background:#000;display:grid;place-items:center;height:480px;font-family:ui-sans-serif,system-ui;} .s{position:relative;width:270px;height:480px;background:{{$BG}};color:#fff;border-radius:18px;overflow:hidden;} .q{position:absolute;left:18px;right:18px;top:60px;font-weight:800;font-size:30px;line-height:1.05;letter-spacing:-.02em;} .a{position:absolute;left:18px;right:18px;top:220px;} .row{display:flex;justify-content:space-between;align-items:center;padding:12px 14px;background:rgba(255,255,255,.06);border:1px solid rgba(255,255,255,.1);border-radius:10px;margin-bottom:8px;} .row.win{background:{{$ACCENT}}22;border-color:{{$ACCENT}};} .row b{font-size:13px;font-weight:700;} .row i{font-style:normal;font-size:12px;opacity:.7;} .row.win i{color:{{$ACCENT}};opacity:1;font-weight:700;} </style> <div class="s"> <div class="q">{{$QUESTION}}</div> <div class="a"> <div class="row"><b>{{$ROW1}}</b><i>{{$VAL1}}</i></div> <div class="row win"><b>{{$ROW2}}</b><i>{{$VAL2}}</i></div> <div class="row"><b>{{$ROW3}}</b><i>{{$VAL3}}</i></div> </div> </div>`} knobs={[ { name: "QUESTION", label: "Question", default: "Fastest static-site generator?" }, { name: "ROW1", label: "Row 1", default: "Next.js" }, { name: "VAL1", label: "Value 1", default: "8.2s" }, { name: "ROW2", label: "Row 2 (winner)", default: "Astro" }, { name: "VAL2", label: "Value 2", default: "2.1s" }, { name: "ROW3", label: "Row 3", default: "Eleventy" }, { name: "VAL3", label: "Value 3", default: "3.5s" }, { name: "ACCENT", label: "Accent", type: "color", default: "#ff3b1f" }, { name: "BG", label: "Background", type: "color", default: "#0d0d12" } ]} height={520} /> ## Captions: do them yourself YouTube's auto-captions are good. They are not good enough to drive a 28-second Shorts script where every word matters. Burn captions into the template: - **One line at a time**, swapped on word boundaries. - **Bottom-third position**, above the subscribe pill. - **High contrast**, dark backing rectangle, white text. See [burning subtitles into MP4](/blog/burn-subtitles-into-mp4) for the typography and timing rules. ## Deterministic batch from JSON The render command, conceptually: ```bash hf render \ --template shorts-template.html \ --data shorts-batch.json \ --out ./shorts/ \ --size 1080x1920 \ --fps 30 ``` One MP4 per JSON entry, named after a slug. From there, a YouTube Studio bulk upload (or the YouTube API if you want to go further) gets them scheduled. ## What scales, what doesn't This pipeline scales for explainers, leaderboards, listicles, "did you know" cards, and any data-first content. It does not scale for face-to-camera content — for that, you still need a person and a camera. The right call is to run both pipelines in parallel: one for the templated data-driven shorts (high volume, moderate floor), one for face-to-camera (low volume, high ceiling). The data-driven stream warms the algorithm for the face-to-camera stream to land on. [Open the playground](/playground), build one Shorts template, queue thirty for next week. --- # Motion graphics in 80 lines URL: https://hyperframes.video/blog/motion-graphics-in-80-lines Published: 2026-05-07T15:30:00.000Z Tags: tutorial, css, animation, design Author: marcus-okafor Show, don't tell. So here is the file. Eighty lines, complete, no dependencies, deterministic when run through `hyperframes render`. It produces a five-second title sequence with bouncy character animation, a parallax backdrop, a signal-color accent line, and a cinematic ease-out. I will walk through every decision after the listing, but I want you to see the whole thing first, because the surprise is that there is no surprise. ```html <!DOCTYPE html> <html> <head> <style> :root { --cream: #f6f5f1; --ink: #0a0a0a; --signal: #ff3b1f; --ease: cubic-bezier(.16, 1, .3, 1); } html, body { margin: 0; height: 100vh; overflow: hidden; background: var(--cream); color: var(--ink); font-family: "Newsreader", Georgia, serif; } .stage { position: relative; height: 100vh; display: grid; place-items: center; } .backdrop { position: absolute; inset: 0; background: radial-gradient(ellipse at 30% 40%, color-mix(in oklab, var(--signal) 14%, transparent), transparent 60%); animation: drift 7s var(--ease) both; animation-play-state: paused; animation-delay: calc(var(--hf-time, 0s) * -1); } @keyframes drift { from { transform: scale(1.1) translateX(-3%); opacity: 0; } 20% { opacity: 1; } to { transform: scale(1.0) translateX(3%); opacity: 1; } } h1 { position: relative; z-index: 1; font-size: clamp(56px, 9vw, 140px); font-weight: 500; letter-spacing: -0.025em; line-height: 0.95; margin: 0; max-width: 12ch; text-align: center; } .ch { display: inline-block; animation: rise 800ms var(--ease) both; animation-play-state: paused; animation-delay: calc((var(--hf-time, 0s) * -1) + var(--d, 0s)); } @keyframes rise { from { opacity: 0; transform: translateY(40px) rotateX(70deg); } to { opacity: 1; transform: translateY(0) rotateX(0); } } em { color: var(--signal); font-style: italic; } .rule { position: absolute; bottom: 16vh; left: 50%; width: 0; height: 2px; background: var(--signal); transform: translateX(-50%); animation: stretch 1.2s var(--ease) 2.4s both; animation-play-state: paused; animation-delay: calc((var(--hf-time, 0s) * -1) + 2.4s); } @keyframes stretch { to { width: 22vw; } } .caption { position: absolute; bottom: 10vh; left: 50%; transform: translateX(-50%); font-family: ui-monospace, monospace; font-size: 12px; letter-spacing: 0.22em; text-transform: uppercase; color: color-mix(in oklab, var(--ink) 60%, transparent); opacity: 0; animation: fade 600ms var(--ease) 3s both; animation-play-state: paused; animation-delay: calc((var(--hf-time, 0s) * -1) + 3s); } @keyframes fade { to { opacity: 1; } } </style> </head> <body data-duration="5"> <div class="stage"> <div class="backdrop"></div> <h1 id="title">A film for <em>frame zero</em>.</h1> <div class="rule"></div> <div class="caption">a hyperframes original</div> </div> <script> const t = document.getElementById("title"); t.innerHTML = t.innerHTML.replace(/(\S)/g, (m, c, i) => `<span class="ch" style="--d:${(i * 35) + 200}ms">${c}</span>`) .replace(/<span class="ch"[^>]*> <\/span>/g, " "); </script> </body> </html> ``` That is the whole composition. Save it as `title.html`, run `npx hyperframes render title.html --duration 5`, and you have an MP4. Now let us go through the design choices. ## The CSS variable trick The single most important pattern in this file is the use of `--hf-time`. HyperFrames sets this variable on `:root` for every frame. CSS animations are paused (`animation-play-state: paused`) and their `animation-delay` is computed from `--hf-time`. The result is that every animation is driven by the engine's clock, not the browser's wall clock. This is what makes the render deterministic. Note the formula: `animation-delay: calc((var(--hf-time, 0s) * -1) + var(--start, 0s))`. The negative time scrubs the animation forward; the start offset positions when it begins. When `--hf-time` is 0s, the animation is at its start. When `--hf-time` is 2.4s and the start is 2.4s, the animation has just begun. This idiom is in every HyperFrames composition I write. ## The character split The script at the bottom is the only JavaScript in the file, and it runs exactly once at page load. It walks the title text, wraps every non-space character in a span, and gives each span a staggered `--d`. The CSS animation rule reads `--d` and offsets the rise by it. This pattern — split characters, stagger animation-delay — is the most reused motion pattern in editorial design. You see it on every well-designed news graphic. The reason it works is that the human eye reads characters left-to-right at roughly 35-50ms per character, and when the animation matches that cadence, the title appears to *land into place* rather than fade in as a block. I have tried this with libraries — SplitText from GreenSock, splitting in JavaScript at runtime, splitting with CSS pseudo-elements. The plain HTML approach beats all of them for clarity and for render speed. The DOM is built once, the animation is declarative, the engine seeks into it. ## The easing curve, specifically `cubic-bezier(.16, 1, .3, 1)` is the most important number in this composition. It is the easing curve I use as my default for editorial motion. It comes from a long line of "cinematic" eases — Apple's Big Sur curve, the Material 3 "emphasized" easing, the unofficial "EaseOutExpo" that motion designers have shared on Twitter for a decade. The key feature of the curve is that it spends most of its travel in the first 30% of the duration. The element moves fast, then slows, then *settles*. Settling is what makes motion feel intentional. Linear motion feels mechanical; quadratic motion feels generic; this curve feels like something a hand placed there. I have an entire post coming on [easing that looks like money](/blog/easing-that-looks-like-money), but the short version is: every editorial motion designer has three or four curves in their muscle memory, and changing one of them changes the entire personality of a video. This curve is one of mine. It is not the only correct answer. It is a correct answer. ## The backdrop The backdrop is a single `radial-gradient`, color-mixed at 14% opacity with the signal color. The reason it works is that it is *subtle*. The role of the backdrop is not to be seen. It is to give the eye somewhere soft to land before the title arrives, and to add chromatic warmth to a frame that is otherwise pure cream and ink. The `drift` animation moves the gradient slightly over seven seconds. The displacement is 6% of the viewport — large enough to register as motion, small enough that you do not catch it consciously. This is the difference between motion graphics that feel alive and motion graphics that feel static. Backgrounds in the best editorial work are always moving, but you have to look to see it. ## The rule and the caption The rule (the 22vw signal-colored line) is the second beat of the composition. It enters at 2.4 seconds — after the title has settled but before the eye gets bored. The width stretches from 0 to 22vw over 1.2 seconds with the same cinematic ease. It is a single `<div>` and a single keyframe. The caption ("a hyperframes original") arrives at 3 seconds. It is monospaced, all-caps, letter-spaced wide, and dimmed to 60% ink. The combination of the rule and the caption is the editorial equivalent of a director's credit on a film — small, confident, in service to the title above. The font shift from serif to mono is the visual equivalent of a different voice. I want to point out that nothing in this composition is technically impressive. There is no shader. There is no Lottie. There is no WebGL. There is not even a JavaScript animation library. The whole file is HTML, CSS, and one small script. The motion designers I respect most can produce work like this in twenty minutes; the rest of us can produce it in an hour with a reference. ## What 80 lines buys you I want to land the post on this point. Eighty lines of HTML is not a small amount of code, but it is a small amount of *artifact*. The compositions I see motion designers ship in After Effects are typically project files that are megabytes large, with sixty layers, two dozen pre-comps, and a maze of expressions. The output is gorgeous. The artifact is unreviewable. This 80-line file, by contrast, fits in a pull request. A reviewer can read it. An agent can edit it. (See the [After Effects comparison](/compare/after-effects) for the longer version of this argument.) A new team member can understand it. The whole composition is grep-able. If we change `--signal` once in the file, every appearance updates. If we want to A/B test the easing curve, we change one cubic-bezier and re-render. This is the trade I keep coming back to: After Effects optimizes for the moment of authorship. HTML optimizes for the next ten years of the file's life. Both are valid. But increasingly, the second one is the one that matters, because the file is going to be edited by someone other than its original author, and that someone might be a model. ## Variations from the same template Once you have the pattern — `--hf-time` clock, paused animations, calc-driven delays, character splits, cinematic ease — every new composition is a variation. The 80 lines above turn into: - A 6-second product reveal by swapping the title and adding a third beat. - A 15-second testimonial by adding a portrait image and an inset quote. - A nine-second loop for a social ad by changing the duration and the resolution. - A vertical 9:16 by tweaking the `clamp()` and the alignment. None of these are new compositions, structurally. They are the same file, with different parameters. The substrate is text, the changes are diffs, and every variation is in the same git history as the original. This is the unlock that makes HTML the right substrate for motion: the file is the design system, the design system is the file, and the file is something you can keep changing for years. ## Why the file is exactly this size Eighty lines is not a magic number, but it is a deliberate one. A shorter composition starts to feel undercooked — title appears, title disappears, done. A longer composition starts to require structure: helper functions, sub-components, a build step. The eighty-line zone is where a single human can hold the entire piece in their head at once, and where an LLM can rewrite it without losing context. I have found, across hundreds of compositions I have shipped this past year, that the sweet spot for a single-file editorial composition is somewhere between 60 and 140 lines. Below 60 the composition feels thin. Above 140 the file starts to drift; bugs creep in; the agent loop slows down because the model has to keep more in working memory. When a composition wants to be larger than 140 lines, that is the signal to start extracting. A reusable lower third becomes its own file. A chart component lives in `components/chart.html` and is included via fetch. The brand tokens live in `brand.css`. The discipline is the same as software: extract when complexity demands it, not before. ## A note on what is missing If you read the listing carefully, you will notice things that are not there. There is no JavaScript animation library. No GSAP, no Anime, no Motion. There is no Three.js even though the title has a `rotateX` effect that looks 3D-ish (it is just CSS 3D transform). There is no canvas, no SVG (except indirectly through the way fonts render). There is no requestAnimationFrame loop. There is no shader. This is intentional. The composition demonstrates that for an enormous category of editorial motion graphics — perhaps the majority of what motion designers actually ship in their day-to-day — the browser's built-in primitives are enough. CSS animations plus a single character-split script gets you most of the way to "professional." The frameworks come in when you have a specific need the primitives cannot meet, and in editorial work that need is rare. I am not arguing against frameworks. GSAP is wonderful. Anime is delightful. Lottie is the right answer for some specific shapes of work. But the reflex of "I need motion, therefore I need a framework" is one of the things that makes motion design feel inaccessible to web developers who could otherwise produce excellent work. The primitives are sitting there. You can start. Open your editor — or drop the listing straight into the [HyperFrames playground](/playground) and skip the install. Run the render. Then change one number. See what moves. --- # The easing curves cheat sheet (code, not theory) URL: https://hyperframes.video/blog/easing-curves-cheatsheet Published: 2026-05-07T14:00:00.000Z Tags: easing, css, animation, reference Author: marcus-okafor I have written about [easing that looks like money](/blog/easing-that-looks-like-money) at length. That piece is about taste; this one is about reference. Twelve curves, side by side, with the numbers worth typing into a CSS file. Bookmark it. ## The twelve ## Linear ```css linear ``` The dot moves at constant velocity. Useful for: backgrounds, scroll progress, anything where motion is communication rather than performance. Avoid for: literally any other entrance or exit. ## Ease ```css ease /* cubic-bezier(.25, .1, .25, 1) */ ``` The CSS default. Symmetric in-out. Used everywhere by default; opinion-free, slightly cinematic. Replace it with something tuned for the moment. ## Settle (a.k.a. expo-out variants) ```css cubic-bezier(.16, 1, .3, 1) ``` My single most-used curve. Cinematic ease-out: rushes in, settles slowly. Pair with 600-900ms duration for editorial motion. ## Ease-in ```css cubic-bezier(.42, 0, 1, 1) ``` Slow start, fast finish. Reads as "falling" or "departing." Use for exits, not entrances. ## Ease-out ```css cubic-bezier(0, 0, .58, 1) ``` Fast start, slow finish. The most common useful curve in practice — softer than settle, more energetic than ease. ## Ease-in-out ```css cubic-bezier(.42, 0, .58, 1) ``` Symmetric. Slow on both ends. Used heavily; often slightly mechanical-feeling. A *biased* in-out (longer tail than ramp) usually feels better — try `cubic-bezier(.65, 0, .35, 1)`. ## Back ```css cubic-bezier(.34, 1.56, .64, 1) ``` The "overshoot" curve. The element arrives, goes past, snaps back. The `1.56` is the overshoot amount; lower for subtle, higher for cartoonish. ## Bounce ```javascript // CSS does not have a bounce easing, but you can fake it with multiple keyframes @keyframes bounce { 0% { transform: translateY(-300px); } 60% { transform: translateY(0); } 75% { transform: translateY(-30px); } 100% { transform: translateY(0); } } ``` Multiple bounces of decaying amplitude. Used in playful contexts; rarely the right call for a corporate brand. ## Elastic ```javascript // Not a cubic-bezier — needs a keyframe or JS function elastic(t) { const c = (2 * Math.PI) / 3; return t === 0 ? 0 : t === 1 ? 1 : Math.pow(2, -10 * t) * Math.sin((t * 10 - 0.75) * c) + 1; } ``` A spring with many oscillations. Reads as "rubber band." Use with extreme caution — looks like an animation library demo. ## Expo-out ```css cubic-bezier(.19, 1, .22, 1) ``` More aggressive than settle. The element *snaps* in and rests immediately. Pair with shorter durations (400-600ms). ## Circ-out ```css cubic-bezier(0, .55, .45, 1) ``` A constant-rate-of-acceleration feel. Reads as smoother than ease-out at small distances. Good for hovers, button feedback. ## Quint-out ```css cubic-bezier(.22, 1, .36, 1) ``` Between settle and expo-out. Slightly more rest at the end. Good middle ground for "make this feel cinematic without committing to a long tail." ## The chart A comparison everybody finds clarifying — every curve plotted on the same axes. The exercise is to watch it once, then close it and pick the three you will use this quarter. ## How to actually use this The honest advice for picking from twelve options: do not. Pick three. 1. **One for entrances.** Probably `cubic-bezier(.16, 1, .3, 1)` (settle). 2. **One for exits.** Probably `cubic-bezier(.7, 0, .84, 0)` (ease-in-quad-ish). 3. **One for hero moments.** Probably `cubic-bezier(.34, 1.56, .64, 1)` (back). Use those three on everything. Reserve the other nine for "I am making a deliberate exception today." The reason elite motion design feels consistent is that elite motion designers picked their three and stopped. The [easing-that-looks-like-money](/blog/easing-that-looks-like-money) post has the longer version of this argument with the six curves I actually use across my own work. ## Bookmark this If you maintain a design system, paste the cubic-bezier values into a `tokens.css`: ```css :root { --ease-settle: cubic-bezier(.16, 1, .3, 1); --ease-exit: cubic-bezier(.7, 0, .84, 0); --ease-back: cubic-bezier(.34, 1.56, .64, 1); --ease-standard: cubic-bezier(.65, 0, .35, 1); } ``` Then every transition in your codebase reads `var(--ease-settle)`. When you tune one curve, every component using it updates. Same play [easing-as-tokens](/blog/three-ts-of-editorial-motion) covers in detail. ## A reference, not a recipe A cheatsheet is for occasional consultation, not a daily-driver decision tool. If you find yourself opening this list every time you write a `transition`, you have not internalized enough of them. Pick the three. Use them. The cheatsheet exists for the day you need a fourth. --- # Build an animated countdown timer in 40 lines of HTML URL: https://hyperframes.video/blog/animated-countdown-timer-html Published: 2026-05-07T09:00:00.000Z Tags: html, countdown, tutorial, marketing Author: ren-park If you have ever shipped a [product launch](/use-cases/product-launches), you know the rhythm: someone in marketing wants a countdown timer for the email, for the landing page, for the Twitter video, and for the in-store screen. Four formats, four different tools, four chances for the wrong date to ship. There is a simpler version where the timer is one HTML file and a target date. Render it to MP4 for the social cuts, embed it on the page, ship it. Forty lines, four variables, done. ## What a countdown actually is Strip the polish away and a countdown is `target_timestamp - now`, formatted as DD:HH:MM:SS, updated once per second. The polish is everything else: tabular numerals so digits do not dance, a subtle scale on tick, a label row underneath, padding that keeps the layout from reflowing when "9 days" becomes "10 days." If you build it in code, the polish is reusable. If you build it in After Effects, the polish is one designer's afternoon. ## The four-cell layout A countdown reads cleanest as four equal cells: days, hours, minutes, seconds. Equal widths means the eye can compare them; tabular-num digits means a `7` and a `1` occupy the same width. CSS Grid with `grid-template-columns: repeat(4, 1fr)` and `font-variant-numeric: tabular-nums` gets you there in two declarations. The label under each cell is small, uppercase, and tracked-out (`letter-spacing: .3em`). It should be quieter than the digits — the digits are the message, the labels are the legend. ## The variables that matter The four knobs a marketer will actually touch: - Target date (ISO string) - Headline ("Launching in", "Tickets on sale", "Until close") - Accent color - Background Everything else — cell radius, gap, label tracking — is a design decision. Lock it in the template; do not expose it. <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:white;height:100vh;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .cd{display:flex;gap:20px;} .cell{background:rgba(255,255,255,.06);border:1px solid rgba(255,255,255,.12);border-radius:16px;padding:24px;min-width:120px;text-align:center;} .n{font-size:80px;font-weight:800;letter-spacing:-.04em;color:{{$ACCENT}};font-variant-numeric:tabular-nums;} .l{font-size:11px;letter-spacing:.3em;text-transform:uppercase;margin-top:8px;opacity:.6;} .t{position:absolute;top:60px;left:50%;transform:translateX(-50%);font-size:13px;letter-spacing:.35em;text-transform:uppercase;opacity:.5;}</style> <div class="t">{{$TITLE}}</div> <div class="cd"><div class="cell"><div class="n">07</div><div class="l">days</div></div><div class="cell"><div class="n">14</div><div class="l">hours</div></div><div class="cell"><div class="n">22</div><div class="l">minutes</div></div><div class="cell"><div class="n">39</div><div class="l">seconds</div></div></div>`} knobs={[ { name: "TITLE", label: "Headline", default: "Launching in" }, { name: "ACCENT", label: "Digit color", type: "color", default: "#ff3b1f" }, { name: "BG", label: "Background", type: "color", default: "#0a0a0a" } ]} /> ## The tick animation A countdown that updates without any visual feedback feels broken — the eye keeps watching for a flicker. The minimum acceptable feedback is a 100ms scale pulse on the seconds cell every second: scale to 1.04, ease back. Anything more elaborate (flip-cards, slot-machine rolls) is a brand decision; the pulse is a usability decision. Implement it as a CSS animation triggered by a class toggle, not a transition. Transitions are interrupted by re-renders; an animation keyframe runs to completion regardless. ## Render once, ship four places The reason this matters: a single HTML countdown template renders to MP4 for social, lives on the landing page as an iframe, and exports as a still for the email. One source of truth. The [HyperFrames render pipeline](/tools/html-to-video) is built for exactly this pattern — same template, multiple aspect ratios, deterministic output. The 16:9 cut for YouTube and the 9:16 cut for Reels come from the same forty lines. ## Edge cases worth knowing A few things will bite you the first time: 1. **Timezones.** Always store the target as an ISO string with `Z` (UTC). Render in the viewer's timezone. Compare apples to apples on the server. 2. **The "0 days" state.** Decide whether the timer goes to negative ("Live for 2:14:33") or freezes at zero. Both are valid; both should be a one-line decision in the template. 3. **Reduced motion.** Respect `prefers-reduced-motion`. Disable the pulse; the digits still update. None of these are hard. All of them are easy to forget the first time. ## When code beats a tool For a single countdown rendered once, a motion tool wins on speed. For a countdown that gets rebranded twice, re-cut three ways, and re-rendered every Friday until the launch, code wins by an order of magnitude. The dividing line is usually around the second iteration. If you are building a launch, a sale, or any event with a date — start in code. The first version takes longer; every version after takes minutes. Open the [playground](/playground), paste the timer example, change the date. --- # Social cards in 1:1, 16:9, and 9:16 from one template URL: https://hyperframes.video/blog/social-media-cards-every-ratio Published: 2026-05-06T15:00:00.000Z Tags: social, templates, responsive, marketing Author: hf-team A campaign that ships to Instagram, TikTok, YouTube, LinkedIn, and X needs five aspect ratios from the same content. Most teams handle this with five After Effects files that get out of sync as the brand evolves. There is a faster path: one HTML template that responds to its container, three render passes. ## The three aspects that cover most The math: five common ratios collapse to three layout modes. - **16:9 (YouTube, LinkedIn, X video)** — wide hero, side-by-side type and visual. - **1:1 (Instagram feed, X image)** — stacked, type-heavy, central focus. - **9:16 (Reels, TikTok, Stories)** — vertical, large text, motion in the upper third. Get these three right and you cover 95% of distribution. ## Container queries do the work The CSS feature that makes this possible: `@container`. Rather than checking the viewport, you check the *element's* size. The template wraps everything in a container; the children's layout responds to whatever aspect ratio that container has. ```css .frame { container-type: size; container-name: frame; } @container frame (aspect-ratio > 16/10) { .stage { grid-template-columns: 1fr 1fr; } .type { font-size: 64px; text-align: left; } } @container frame (aspect-ratio < 1.1) and (aspect-ratio > .9) { .stage { grid-template-columns: 1fr; } .type { font-size: 72px; text-align: center; } } @container frame (aspect-ratio < 9/10) { .stage { grid-template-rows: 1fr 1fr; } .type { font-size: 84px; text-align: center; } } ``` Three media queries, three layouts. The renderer sizes the container per output; the layout follows. ## The shared content What stays the same across aspects: - Headline copy - Brand mark - Color palette - Animation timing What changes: - Grid direction (column vs row) - Type scale - Element ordering (logo above the headline on 9:16, beside it on 16:9) If you find yourself changing the *content* between aspects, you have two templates, not one. Stop and rethink. ## A concrete example The same campaign card, rendered three ways: <CodeTabs tabs={[ { label: "1:1 (square)", html: `<style>body{margin:0;background:#f6f5f1;font-family:ui-sans-serif,system-ui;}.f{aspect-ratio:1/1;width:100vmin;display:grid;place-items:center;padding:48px;text-align:center;}.h{font-size:72px;font-weight:900;letter-spacing:-.04em;line-height:1;}.s{font-size:18px;margin-top:24px;color:#6b6862;letter-spacing:.2em;text-transform:uppercase;}</style><div class="f"><div><div class="h">Render anywhere.</div><div class="s">video, in your stack</div></div></div>` }, { label: "16:9 (wide)", html: `<style>body{margin:0;background:#f6f5f1;font-family:ui-sans-serif,system-ui;}.f{aspect-ratio:16/9;width:100vw;max-height:100vh;display:grid;grid-template-columns:1fr 1fr;gap:32px;padding:48px;align-items:center;}.h{font-size:84px;font-weight:900;letter-spacing:-.04em;line-height:1;}.s{font-size:20px;margin-top:18px;color:#6b6862;letter-spacing:.2em;text-transform:uppercase;}.r{height:240px;background:#ff3b1f;border-radius:32px;}</style><div class="f"><div><div class="h">Render anywhere.</div><div class="s">video, in your stack</div></div><div class="r"></div></div>` }, { label: "9:16 (vertical)", html: `<style>body{margin:0;background:#f6f5f1;font-family:ui-sans-serif,system-ui;}.f{aspect-ratio:9/16;height:100vh;display:grid;grid-template-rows:1fr 1fr;padding:48px;text-align:center;}.h{font-size:80px;font-weight:900;letter-spacing:-.04em;line-height:1;align-self:end;}.s{font-size:18px;margin-top:32px;color:#6b6862;letter-spacing:.2em;text-transform:uppercase;}.r{height:80%;align-self:center;background:#ff3b1f;border-radius:32px;}</style><div class="f"><div><div class="h">Render anywhere.</div><div class="s">video, in your stack</div></div><div class="r"></div></div>` } ]} caption="One semantic content set, three layouts. Switch tabs to see each." /> ## Render specs per platform The dimensions that matter, by platform: | Platform | Aspect | Resolution | Duration | |---|---|---|---| | Instagram feed | 1:1 | 1080×1080 | up to 60s | | Instagram Reels | 9:16 | 1080×1920 | 15-90s | | TikTok | 9:16 | 1080×1920 | 15-180s | | YouTube | 16:9 | 1920×1080 | any | | YouTube Shorts | 9:16 | 1080×1920 | up to 60s | | LinkedIn | 1:1 or 16:9 | 1080×1080 or 1920×1080 | up to 10min | | X video | 16:9 | 1280×720 | up to 140s | Pin these in your render configuration. A render at the wrong dimensions gets re-encoded by the platform, which means quality loss on top of the size loss. ## The render matrix In CI, the matrix is one row per (template, aspect) pair: ```yaml strategy: matrix: aspect: [1:1, 16:9, 9:16] steps: - run: pnpm render --template campaign.html --aspect ${{ matrix.aspect }} --out out/${{ matrix.aspect }}.mp4 ``` Three renders, three minutes total on a decent runner. Outputs upload to S3; the publishing tool picks the right MP4 for the right platform. See [the GitHub Actions integration](/integrations/github-actions) for the full workflow. ## The brand-time benefit The reason this is worth the upfront work: when the brand evolves, you fix one template. The next campaign ships in the new style across every aspect ratio simultaneously. No "we forgot to update the TikTok cut" Slack message. No "the YouTube version still has the old logo" thread. For [marketing teams shipping weekly](/use-cases/marketing), this compounds fast. The first month of one-template-many-aspects feels marginal. The third month, you stop noticing that the social cuts even need to be made. They are build artifacts. The pattern lives outside of social, too — [emails](/integrations/nextjs), [landing pages](/), [out-of-home](/use-cases) — every surface gets a render at its native dimensions. Same source, many outputs. The static-site-generator playbook, for video. --- # Stripe-style payment success animation in pure CSS URL: https://hyperframes.video/blog/payment-success-animation-css Published: 2026-05-06T09:00:00.000Z Tags: css, animation, stripe, tutorial, ui Author: marcus-okafor If you've ever paid for anything online in the last five years, you have seen this animation: the ring closes, a checkmark draws, the whole thing settles with a single elastic overshoot, and you exhale. Stripe's checkout team got it right; everyone copied it; now it is the visual grammar of "your money moved." There are two reasons to rebuild this in your own code instead of leaning on a Lottie file. First, it is small — under 80 lines of CSS. Second, you can render it to MP4 for a launch video, a help article, or an onboarding flow without touching After Effects. ## What the animation actually does Three timed beats, in this order: 1. **The ring traces** (0.0 – 0.5s) — a circle stroke draws from the top, clockwise, all the way around. Eases in slow, eases out faster. 2. **The checkmark draws** (0.4 – 0.7s) — overlapping with the ring's last 100ms, so the eye reads continuity, not sequence. 3. **The whole thing settles** (0.7 – 0.95s) — a single elastic overshoot, ~6%, then back. This is the "exhale" beat. Total runtime: under a second. Anything slower and it feels like a loading spinner instead of a confirmation. ## The ring (technique) A stroked SVG `<circle>` with `stroke-dasharray` set to the circumference and `stroke-dashoffset` animated to zero. The rotation trick: start the stroke from the top instead of the right with `transform="rotate(-90)"`. ## The checkmark (technique) An SVG `<path>` with `stroke-dasharray` and `stroke-dashoffset` — same primitive as the ring, applied to a polyline. The path is just two line segments: down-and-right, then up-and-right. ## The elastic settle (technique) CSS `transform: scale()` with a custom easing curve. The cubic-bezier that gives that "snappy overshoot" without feeling cartoonish: ```css transition: transform 0.25s cubic-bezier(0.34, 1.56, 0.64, 1); ``` The `1.56` second control point is what pushes past 1.0 before settling. Drop it to `1.2` for subtle, push to `1.8` for cartoony. ## The full source <CodeTabs tabs={[ { label: "HTML", lang: "html", code: `<div class="success"> <svg viewBox="0 0 100 100" class="ring"> <circle cx="50" cy="50" r="44" fill="none" stroke="#1f8a5b" stroke-width="6" stroke-linecap="round" stroke-dasharray="277" stroke-dashoffset="277" transform="rotate(-90 50 50)"/> <path class="check" d="M30 52 L46 66 L72 38" fill="none" stroke="#1f8a5b" stroke-width="7" stroke-linecap="round" stroke-linejoin="round" stroke-dasharray="80" stroke-dashoffset="80"/> </svg> <div class="text">Payment received</div> </div>`, }, { label: "CSS", lang: "css", code: `.success { display: grid; place-items: center; gap: 16px; animation: settle 0.95s cubic-bezier(0.34, 1.56, 0.64, 1) forwards; } .ring circle { animation: trace 0.5s ease-out 0.05s forwards; } .check { animation: trace 0.3s ease-out 0.4s forwards; } @keyframes trace { to { stroke-dashoffset: 0; } } @keyframes settle { 0% { transform: scale(0.85); opacity: 0; } 60% { transform: scale(1.06); opacity: 1; } 100% { transform: scale(1.00); opacity: 1; } } .text { font: 600 18px ui-sans-serif, system-ui; letter-spacing: -0.01em; color: #0a0a0a; }`, }, { label: "Live", html: `<!doctype html><html><head><style> body{margin:0;background:#f6f5f1;display:grid;place-items:center;height:100vh;} .s{display:grid;place-items:center;gap:16px;animation:settle 1.1s cubic-bezier(.34,1.56,.64,1) infinite alternate;} svg{width:120px;height:120px;} .r{animation:trace 0.5s ease-out 0.05s infinite alternate;} .c{animation:trace 0.3s ease-out 0.4s infinite alternate;} @keyframes trace{from{stroke-dashoffset:80;}to{stroke-dashoffset:0;}} @keyframes settle{0%{transform:scale(.85);opacity:0;}60%{transform:scale(1.06);opacity:1;}100%{transform:scale(1);opacity:1;}} .t{font:600 18px ui-sans-serif,system-ui;letter-spacing:-.01em;color:#0a0a0a;} </style></head><body> <div class="s"> <svg viewBox="0 0 100 100"> <circle class="r" cx="50" cy="50" r="44" fill="none" stroke="#1f8a5b" stroke-width="6" stroke-linecap="round" stroke-dasharray="277" stroke-dashoffset="277" transform="rotate(-90 50 50)"><animate attributeName="stroke-dashoffset" from="277" to="0" dur=".5s" begin=".05s" fill="freeze" repeatCount="indefinite"/></circle> <path class="c" d="M30 52 L46 66 L72 38" fill="none" stroke="#1f8a5b" stroke-width="7" stroke-linecap="round" stroke-linejoin="round" stroke-dasharray="80" stroke-dashoffset="80"><animate attributeName="stroke-dashoffset" from="80" to="0" dur=".3s" begin=".4s" fill="freeze" repeatCount="indefinite"/></path> </svg> <div class="t">Payment received</div> </div> </body></html>`, }, ]} caption="Source, CSS, and live preview of the success animation." height={420} /> ## Variants worth keeping Every product needs a success animation, but the variant rarely matters as long as the timing rules are kept. Three useful skins: - **Green ring + check** (above). Default. Reads as "money / commerce." - **Brand-color ring + check**. Same shape, your accent color. Use for "sign-up complete," "form sent." - **Ring + check + 20-particle confetti burst**. Reserve for celebratory states — first purchase, milestone hit. <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};display:grid;place-items:center;height:340px;font-family:ui-sans-serif,system-ui;} .s{display:grid;place-items:center;gap:14px;animation:settle 1.2s cubic-bezier(.34,1.56,.64,1) infinite alternate;} svg{width:110px;height:110px;} @keyframes trace{to{stroke-dashoffset:0;}} @keyframes settle{0%{transform:scale(.85);opacity:0;}60%{transform:scale(1.07);opacity:1;}100%{transform:scale(1);opacity:1;}} .t{font:600 18px ui-sans-serif,system-ui;color:{{$TEXT}};letter-spacing:-.01em;} </style> <div class="s"> <svg viewBox="0 0 100 100"> <circle cx="50" cy="50" r="44" fill="none" stroke="{{$ACCENT}}" stroke-width="6" stroke-linecap="round" stroke-dasharray="277" stroke-dashoffset="277" transform="rotate(-90 50 50)"><animate attributeName="stroke-dashoffset" from="277" to="0" dur=".5s" begin=".05s" fill="freeze" repeatCount="indefinite"/></circle> <path d="M30 52 L46 66 L72 38" fill="none" stroke="{{$ACCENT}}" stroke-width="7" stroke-linecap="round" stroke-linejoin="round" stroke-dasharray="80" stroke-dashoffset="80"><animate attributeName="stroke-dashoffset" from="80" to="0" dur=".3s" begin=".4s" fill="freeze" repeatCount="indefinite"/></path> </svg> <div class="t">{{$LABEL}}</div> </div>`} knobs={[ { name: "LABEL", label: "Label", default: "Payment received" }, { name: "ACCENT", label: "Accent color", type: "color", default: "#1f8a5b" }, { name: "TEXT", label: "Text color", type: "color", default: "#0a0a0a" }, { name: "BG", label: "Background", type: "color", default: "#f6f5f1" } ]} /> ## Rendering to MP4 (for onboarding / launch videos) The animation is pure CSS/SVG, so it lifts directly into the [render pipeline](/tools/html-to-video). Two practical uses: - **Onboarding video** — drop the success state into a 9:16 walkthrough, render once. - **Launch video** — pair the animation with [confetti particles](/blog/css-confetti-particle-effect) and a tagline for a 4-second hero loop. Set the loop duration to 1.2s with a 200ms hold at the end — gives the eye time to read the label before the next loop starts. ## Why this still matters Every checkout flow in 2026 still gets this wrong. The two most common failures: the animation runs too long (looks like buffering), or it has no settle beat (feels abrupt). Both are calibration issues, both are fixable in a single CSS file. If you ship a payments product, an onboarding flow, or any transactional UI — copy this pattern, calibrate the timing, ship it once. Then [render it to video](/playground) for the help docs, the marketing site, and the in-app tour. --- # GIF to MP4: why you should, and how to do it right URL: https://hyperframes.video/blog/gif-to-mp4-the-right-way Published: 2026-05-06T08:00:00.000Z Tags: mp4, gif, conversion, performance Author: kira-tanaka The most-shared format on the modern web is not, technically, video. GIF is a 1987 image format with a `delay` attribute on each frame. Every loop you have ever seen on Twitter or Slack is a sequence of still GIFs cycling at a frame rate the format pretends not to have. This is fine for stickers. For anything longer than two seconds or larger than 480px, GIF is the wrong format. Here is the conversion to MP4, and the parameters that actually matter. ## What GIF gives up The constraints baked into GIF, in order of how much they hurt: 1. **256 colors per frame.** Every frame is palette-indexed. Gradients band. Skin tones look like illustrations. 2. **No inter-frame compression.** Each frame is encoded mostly independently. A 5-second loop is ~30× the bytes of equivalent H.264. 3. **No alpha.** GIF has 1-bit transparency, which is why "transparent" GIFs have white halos. 4. **No audio.** Obvious. Sometimes forgotten. MP4 fixes all four. The size difference is the easy sell: a 480p, 5-second GIF that is 4MB becomes a 200KB MP4 with identical visual quality. The color and gradient improvements are the *real* sell once you see them side-by-side. ## When to convert Convert if any of these are true: - The asset is over 1MB as a GIF. - The asset has gradients, photographs, or skin tones. - The asset is over 3 seconds long. - The target platform supports MP4 autoplay (Twitter, LinkedIn, Discord, modern web). Do not convert if: - The target is email. Most email clients still strip `<video>` and many do not render H.264. - The asset is a 200×200 sticker that already weighs 80KB. The savings are not worth the toolchain. ## The parameters that actually matter The conversion is `ffmpeg -i input.gif output.mp4`. Done. Except — the defaults are wrong for the web. The four flags that change everything: ```bash ffmpeg -i input.gif \ -movflags +faststart \ -pix_fmt yuv420p \ -vf "scale=trunc(iw/2)*2:trunc(ih/2)*2" \ -crf 23 \ output.mp4 ``` What each one does: - `+faststart` — moves the moov atom to the front of the file so the browser starts playback before the whole file downloads. Without this, your "instant" autoplay has a perceptible delay. - `yuv420p` — universal pixel format. Without it, Safari and many TVs will refuse to decode. - `scale=trunc(iw/2)*2:trunc(ih/2)*2` — rounds odd dimensions to even. H.264 requires even dimensions; without this you will hit `width not divisible by 2` mid-encode. - `-crf 23` — quality. 18 is visually lossless, 23 is "very good," 28 is "fine for thumbnails." Default is 23 in most builds, but pin it explicitly so the result is reproducible. ## Size, side-by-side The comparison most people do not believe until they see it: a 5-second GIF on the left, the equivalent MP4 on the right, plotted by file size. The MP4 is also *better looking* — no banding on the gradient, no halo on the edges, no palette flicker. ## The hard cases A few cases that bite: - **Pixel art.** If the GIF is 8-bit retro sprite work, MP4 will smooth it. Use CRF 18 and disable any pre-scaling. - **Single-color backgrounds.** If you need true transparency, MP4 cannot help — use WebM with alpha (VP9) instead. - **Slack.** Slack auto-plays GIFs but requires a hover-tap on MP4. Worth knowing before you migrate your meme library. ## Reverse direction: when MP4 → GIF Sometimes you need to go the other way (email, retro contexts). The reverse conversion is also one command, but the *quality* loss is enormous. Two-pass palette extraction helps: ```bash ffmpeg -i in.mp4 -vf "fps=15,scale=480:-1:flags=lanczos,palettegen" palette.png ffmpeg -i in.mp4 -i palette.png -filter_complex "fps=15,scale=480:-1:flags=lanczos,paletteuse" out.gif ``` If you find yourself doing this often, consider whether the email is the wrong constraint, not the MP4. ## The version-control angle The honest case for MP4 over GIF for any *generated* video is that MP4 fits the pipeline. If you generate your video from HTML — which is what [HyperFrames](/tools/html-to-video) does — the natural output is MP4. The GIF is a downgrade for distribution; the MP4 is the source of truth. This is the same pattern as PNG vs JPEG for design exports — keep the lossless, versionable source; export the lossy distribution format on demand. For video, the lossless source is the HTML template; the distribution format is MP4. GIF, if you need it at all, comes last. See [from DOM to MP4](/blog/from-dom-to-mp4) for the full pipeline, and the [developer integration docs](/developers) if you want to wire conversion into your CI. ## What to do this week If you have an `assets/` folder full of GIFs over 1MB, this is a Tuesday afternoon project. Convert them. Replace `<img src="X.gif">` with `<video src="X.mp4" autoplay muted loop playsinline>`. Watch your page weight drop by an order of magnitude. The conversion is reversible — keep the GIFs around for a week in case anything breaks — but in practice, nothing does. --- # From DOM to MP4: an annotated render URL: https://hyperframes.video/blog/from-dom-to-mp4 Published: 2026-05-05T13:00:00.000Z Tags: engineering, internals, rendering, chromium Author: kira-tanaka When you run `hyperframes render hero.html`, somewhere between sixty and seventeen hundred milliseconds later you have an MP4 on disk. I want to spend this post walking through every millisecond in that interval. The pipeline is not magic. It has six stages, each with specific failure modes, specific optimizations, and specific numbers. After three years of working on it, I can still surprise myself with what is happening between the keystroke and the file. This post assumes you have used the CLI at least once. If you haven't, run `npx hyperframes init` first; it produces a working composition you can render in a few seconds. The [developer hub](/developers) has the SDK reference and the [docs](/docs) cover the CLI flags referenced below. Then come back. ## Stage 1: Composition resolution (5–40ms) The first thing the renderer does is read your HTML file and walk it for references. Every `<img src>`, every `<link>` to a font, every `<script src>`. We build a manifest: every asset that will be needed at render time, and where it lives. Local files are resolved to absolute paths. Remote URLs are validated. This stage exists because of one specific failure: if Chromium opens your file and immediately starts rendering, it will render the first frame before some of those assets have loaded. The composition will look wrong, the first frame will be a flash of unstyled content, and you will be sad. The manifest lets us preheat — fetch and cache everything before the page even opens. We also compute a content hash here. The hash is the SHA-256 of the resolved HTML plus every asset. It is the fingerprint of this exact composition. If you render twice with the same hash, our cache layer (which we will get to) shortcuts the entire pipeline. The build is reproducible because the inputs are. ## Stage 2: Chromium boot (300–800ms cold, 0ms warm) This is the largest single chunk of latency on a cold render, and the one we have spent the most effort eliminating. Booting a fresh headless Chromium takes around half a second. For interactive workflows (preview, agent loops) we keep a warm pool: one or two Chromium processes idling, ready to accept a new tab. Cold boot drops to "open a new page in an existing browser," which is around 20-40ms. The flags we boot Chromium with matter. We disable a bunch of things — GPU rasterization on platforms where it is unstable, background networking, the entire extensions system, telemetry, audio. We enable one thing — synchronous animation timing — which is custom-patched into our Chromium build for reasons I will get to. We also pin the Chromium version. Every release of HyperFrames is locked to a specific Chromium commit. When you upgrade HyperFrames, you upgrade Chromium. When you don't, you don't. This is the only way bitwise determinism survives across machines: everyone has to be running the same renderer. ## Stage 3: Page load and ready-gate (50–300ms) The browser opens your HTML. We do not start capturing yet. Instead, we wait on a series of "ready" gates. Each gate has a name, a max timeout, and a specific signal it waits for. - `dom-ready`: `DOMContentLoaded` fires. The HTML is parsed. - `fonts-ready`: `document.fonts.ready` resolves. Every font referenced in `font-family` has loaded. - `images-ready`: every `<img>` we found in stage 1 has fired `load` and called `decode()`. - `composition-ready`: an optional gate. If the composition defines `window.__hfReady = () => Promise<void>`, we await it. This is the escape hatch for compositions that do their own async setup — fetching JSON, initializing a Three.js scene, loading a Lottie animation. Only when all gates resolve do we move to capture. This is the single most important detail in the entire pipeline. Skip this and you get the bug where frame 0 has fallback fonts and the rest of the video has the right fonts and your viewer sees a one-frame flicker that ruins the whole video. We hit this exactly once, in 2024, and added every guard you see here as a result. ## Stage 4: The seek loop (the main event) This is where most of the wall-clock time goes, and it is the part of the system that needed the most surgery. Here is what happens, in pseudocode, once per frame: ```ts for (let i = 0; i < totalFrames; i++) { const t = i / fps; await page.evaluate((time) => { document.documentElement.style.setProperty("--hf-time", `${time}s`); window.dispatchEvent(new CustomEvent("hf-seek", { detail: { time } })); }, t); await page.evaluate(() => new Promise(r => requestAnimationFrame(() => r()))); const buffer = await page.screenshot({ type: "png", omitBackground: false }); encoder.write(buffer); } ``` A few things are doing a lot of work in those nine lines. The `--hf-time` CSS variable cascades into every CSS animation in the composition. Animations are paused (`animation-play-state: paused`) and their effective progress is computed from the variable via `animation-delay: calc(var(--hf-time) * -1)`. The browser's keyframe interpolator runs as normal; we just lie to it about what time it is. The `hf-seek` event is the escape hatch for JavaScript-driven animations. Authors who want to drive Three.js, Canvas, GSAP, or anything else listen for this event and update their state from `event.detail.time`. The event is synchronous from the renderer's perspective: we wait for the next `requestAnimationFrame` to confirm the DOM has settled. The screenshot itself is the slow part. Even on warm Chromium, taking a 1920×1080 PNG screenshot is around 8-15ms. Across 300 frames at 60fps (a 5-second video), that is 2.4–4.5 seconds of pure capture time. We have explored faster paths — CDP's `Page.captureScreenshot` with `captureBeyondViewport: false` is fastest, and we use it. We've also experimented with a raw framebuffer extraction that skips PNG encoding entirely, piping uncompressed pixels to ffmpeg's stdin. It is around 30% faster but more fragile. We ship it behind `--unsafe-raw-pipe` for users who want the speed. ### What synchronous animation means The single biggest determinism win came from patching Chromium's animation scheduler. Normally, when you change `animation-delay`, the browser updates the visual at the next compositor tick — which is asynchronous, with respect to JavaScript. That means the screenshot we take immediately after the seek might capture the animation at its old position, not its new one. Our patch adds a single function to the CDP protocol: `Animation.flushSync`. It forces the animation host to recompute every animated property immediately, on the main thread, before returning. We call it after every seek event. The cost is minor (200µs); the correctness gain is total. We have submitted this patch upstream; reception has been polite but slow. ## Stage 5: Encoding (parallel with capture) We do not wait for every frame to be captured before starting to encode. ffmpeg runs in a separate process, reading PNGs from a Unix pipe or shared memory. As soon as frame 0 arrives, encoding begins. By the time frame N-1 is captured, frame 0 is already in the muxer. The encoder choice matters. We default to `libx264 -preset medium -crf 18` for general use; the resulting H.264 is universally playable and the quality is high. For users who want smaller files, `-preset slow -crf 20` shaves 30% off the file size at a 2x encode cost. For users who need AV1 we shell out to `libaom-av1`; it is much slower but the bitrate-to-quality curve is dramatically better. For users who need lossless intermediates, we offer `prores_ks` (Apple ProRes) and FFV1. What we do *not* do is re-encode after the fact. The captures are written to the encoder directly; the encoder is the final stage. There is no temporary "raw frames on disk" step, because that would be wasteful and slow. ## Stage 6: Muxing and finalization (20–80ms) The encoder writes a stream of compressed video data. ffmpeg muxes that into an MP4 container — adding the `moov` atom (which describes the video's structure), inserting any audio tracks, writing metadata. The final file lands on disk. We also write a sidecar JSON: a manifest of what was rendered. Composition hash, engine version, Chromium version, ffmpeg version, encoder settings, total duration, total frames, render wall time. This sidecar is invaluable for debugging — when a customer says "this MP4 looks wrong," the first thing we ask for is the manifest. The manifest tells us exactly which pipeline produced it. ## What can go wrong, in priority order After three years, the failure modes have a clear long tail. Here are the top ones, with frequencies from our error telemetry. 1. **Font load timeout** (4.1% of renders). User referenced a font that the network is slow to deliver. Fix: bundle the font locally. 2. **Composition timeout in `__hfReady`** (1.2%). User's async setup never resolved. Usually a fetch that hangs. Fix: add a `Promise.race` with a timeout. 3. **Image 404** (0.8%). User referenced a path that doesn't exist. Fix: lint catches this before render. 4. **Out-of-memory in Chromium** (0.3%). User created a 12-layer-deep filter graph that the rasterizer cannot fit in 4GB. Fix: simplify or render at lower resolution. 5. **Encoder crash** (<0.1%). ffmpeg got something it could not handle. Usually a 16k-wide canvas. Fix: raise a sensible error before invoking the encoder. We work hard to make every one of these fail at *lint* time, not at *render* time. The lint pass catches asset references, font references, suspicious DOM sizes, and missing duration metadata. By the time you run render, the only things that can fail are network and out-of-memory — and even those we trap with clear messages. ## What the pipeline looks like in numbers A representative render: 5 seconds at 1920×1080, 60fps. 300 frames. Warm Chromium. | Stage | Time | Notes | |---|---|---| | Composition resolution | 12ms | Asset walk, hash, manifest | | Chromium boot | 0ms | Warm pool | | Page load + ready gates | 180ms | Most of this is fonts | | Seek loop | 1380ms | ~4.6ms per frame, parallel with encode | | Encoder finalize | 90ms | Mux, moov, sidecar | | **Total** | **1662ms** | | Cold Chromium adds 500-700ms. A 30-second video at the same resolution lands around 9 seconds wall time. A 4K render at 60fps is roughly 4x slower per frame. These numbers are honest, measured on a laptop CPU, with no GPU acceleration of the rasterizer. The takeaway: from your keystroke to your MP4 on disk, every stage is doing specific, measurable work. None of it is magic. All of it is now boring infrastructure, which is exactly what we wanted. If you want a contrast with a different architecture, the [Remotion comparison](/compare/remotion) walks through the same six stages on their pipeline. --- # Why deterministic video rendering matters in CI URL: https://hyperframes.video/blog/deterministic-video-rendering-ci Published: 2026-05-05T11:00:00.000Z Tags: determinism, ci, render, engineering Author: kira-tanaka The single most underrated property of a build system is *determinism*: same input, same output, every time. Software engineering took twenty years to internalize this — reproducible builds, lockfiles, content-addressed caches. Video production has not internalized it yet. Most rendering pipelines are non-deterministic by accident, and most teams do not notice until something breaks. Here is why determinism matters for video, what breaks without it, and how to test that you actually have it. ## What "deterministic" means for video A render is deterministic if, given the same source HTML, the same renderer version, and the same render parameters (resolution, duration, fps), the output MP4 has the same SHA-256 hash on every run, on every machine, forever. That is a very strict definition. Most pipelines fail at the first step: the same source produces different bytes on different runs because of: - Wall-clock-driven animation (`Date.now()`, `performance.now()`) - Random number seeds initialized from the clock - Frame timing variance (`requestAnimationFrame` does not fire at the same offset every run) - Font loading races - Variable bitrate encoding Each one is a small bug; together they make the output a different file every render. ## Why it matters Three concrete wins from determinism: ### 1. CI caching If `render(input) → output_hash`, your CI can skip the render step when the input has not changed. For a marketing team rendering 1,000 variants per launch, this is the difference between a 4-minute pipeline and a 40-minute one. ### 2. Diff review When a designer changes a template, you can see *exactly* what changed. Render the old and the new; diff the two MP4s frame by frame. If the diff is "nothing for the first 3 seconds, then a font weight change at 3.2s," you know what to review. This is impossible if the renders are non-deterministic. The frame-by-frame diff is noise from clock jitter; the real change is invisible. ### 3. Trust The hardest one to measure but the most important. A team that ships videos from CI without re-watching every output is a team that trusts the pipeline. Determinism is the foundation of that trust. A team that has been bitten once by "the same render produced a different video in production" never trusts the pipeline again. ## How to test for it A real CI test: ```bash #!/bin/bash hash1=$(hyperframes render template.html | sha256sum | cut -d' ' -f1) hash2=$(hyperframes render template.html | sha256sum | cut -d' ' -f1) if [ "$hash1" != "$hash2" ]; then echo "Non-deterministic render: $hash1 != $hash2" exit 1 fi ``` Run this in [GitHub Actions](/integrations/github-actions) on every PR that touches a template. If the test fails, the template introduced a non-determinism — wall-clock, randomness, or a font race. Fix it before it ships. For longer renders, do not compare the whole MP4 — compare frame hashes. Two MP4s can differ in container metadata (timestamps, encoder version string) while being identical pixel-by-pixel. Extract frames with ffmpeg, hash each, compare. ## The four sources of non-determinism What to look for when a render *is* drifting: ### Wall-clock ```js // BAD setInterval(() => updatePosition(), 16); // GOOD addEventListener('hf-seek', e => render(e.detail.time)); ``` The render contract should be `render(t)` where `t` is *given*, not measured. The HyperFrames runtime drives `t`; the template never reads the clock. ### Random seeds ```js // BAD const r = Math.random(); // GOOD let seed = 1; function rand() { seed = (seed * 9301 + 49297) % 233280; return seed / 233280; } ``` Initialize a deterministic PRNG with a fixed seed. The output is reproducibly random — same seed, same sequence. ### Font loading ```html  <script> document.fonts.ready.then(() => window.__hf_ready__ = true); </script> ``` The renderer respects `window.__hf_ready__` and waits for it before capturing frames. ### Encoder settings Pin the encoder version, codec parameters, and bitrate mode. Constant-rate-factor (CRF) over variable-bitrate; the same codec library version across runs. The renderer pins these; if you call ffmpeg yourself, pin them in your `Dockerfile`. ## The flake budget A render pipeline that is "deterministic 99% of the time" is *non-deterministic*. The 1% is a flake budget you cannot afford if you want CI to cache and diff. Either the pipeline is byte-stable, or it isn't. The honest test: render the same source 100 times in CI. If 100 hashes are identical, you are deterministic. If even one differs, find the source and fix it. There is no middle ground. ## The downstream features What determinism unlocks, once you have it: - **Content-addressed video storage.** Hash the source HTML; the hash is the cache key. Re-renders deduplicate automatically. - **Visual regression testing.** Snapshot the rendered MP4; fail the test if any frame changes. Works the same way as screenshot testing for UI. - **Rollback.** If a render in production looks wrong, you have a hash; pull the source HTML at that hash; reproduce locally. None of these are possible without byte-stable output. All of them are how teams that ship video at scale stay sane. For more on the philosophy here, see [the deterministic video manifesto](/blog/deterministic-video-manifesto). For the implementation, see the [developers overview](/developers). The TL;DR: determinism is not a performance optimization. It is a correctness property. Pipelines that have it are operating in a different category from pipelines that do not. If you ship more than one video a quarter from code, this is the property to insist on. --- # 10 CSS progress bars worth copying (with full source) URL: https://hyperframes.video/blog/css-progress-bar-collection Published: 2026-05-04T09:00:00.000Z Tags: css, progress-bar, tutorial, ui Author: ren-park The progress bar is the most-implemented and least-thought-about UI element. Every product ships at least three: file uploads, multi-step forms, video scrubbers. Most of them look the same because most engineers reach for the default `<progress>` element and stop. Here are ten progress bars worth copying, each with the full source. They're CSS-only, deterministic, and render straight to MP4 if you need them in a loading-screen onboarding video. ## 1. The simple fill The baseline. A track, a fill, a width transition. Add a 200ms ease-out and it's already better than 80% of what ships. ```html <div class="bar"><div class="fill" style="width:62%"></div></div> ``` ```css .bar { height: 6px; background: #1a1a1a; border-radius: 999px; overflow: hidden; } .fill { height: 100%; background: #ff3b1f; transition: width .2s ease-out; } ``` ## 2. The animated stripes (determinate) Classic loading-bar feel: a striped fill that scrolls slowly. Use a background-image gradient with 45° stripes. <InlineSandbox html={`<!doctype html><html><head><style> body{margin:0;background:#0a0a0a;display:grid;place-items:center;height:100vh;font-family:ui-sans-serif,system-ui;color:#fff;} .row{width:480px;display:grid;gap:20px;} .label{font:600 11px ui-monospace,monospace;letter-spacing:.2em;text-transform:uppercase;color:rgba(255,255,255,.55);} .bar{height:10px;background:#1a1a1a;border-radius:999px;overflow:hidden;} .fill{height:100%;width:62%;border-radius:999px;background:linear-gradient(45deg,#ff3b1f 25%,#ff6a4a 25%,#ff6a4a 50%,#ff3b1f 50%,#ff3b1f 75%,#ff6a4a 75%);background-size:20px 20px;animation:slide 1s linear infinite;} @keyframes slide { to { background-position: 40px 0; } } </style></head><body> <div class="row"> <div><div class="label">Uploading · 62%</div><div class="bar"><div class="fill"></div></div></div> </div> </body></html>`} height={140} caption="Striped fill — the stripes scroll while the width stays put." /> ## 3. The indeterminate loop For "I don't know how long this will take." A small bright segment slides across the full track on a 1.5s loop. ```css .fill { position: absolute; height: 100%; width: 30%; background: #ff3b1f; border-radius: 999px; animation: indet 1.5s cubic-bezier(.4,0,.2,1) infinite; } @keyframes indet { 0% { left: -30%; } 100% { left: 100%; } } ``` ## 4. The gradient sweep A fill with an animated color gradient — implies activity even when the width isn't changing. <InlineSandbox html={`<!doctype html><html><head><style> body{margin:0;background:#0a0a0a;display:grid;place-items:center;height:100vh;font-family:ui-sans-serif,system-ui;color:#fff;} .bar{width:480px;height:12px;background:#1a1a1a;border-radius:999px;overflow:hidden;} .fill{height:100%;width:62%;background:linear-gradient(90deg,#ff3b1f,#ff9b00,#ff3b1f);background-size:200% 100%;animation:sweep 2s linear infinite;border-radius:999px;} @keyframes sweep { to { background-position: -200% 0; } } </style></head><body><div class="bar"><div class="fill"></div></div></body></html>`} height={120} caption="Gradient sweeps left while the width stays constant." /> ## 5. The segmented bar For multi-step flows. Five (or N) discrete pills, the completed ones filled. ```html <div class="seg" data-step="3"> <span></span><span></span><span></span><span></span><span></span> </div> ``` ```css .seg { display: grid; grid-template-columns: repeat(5, 1fr); gap: 6px; } .seg span { height: 4px; background: #1a1a1a; border-radius: 999px; } .seg span:nth-child(-n+3) { background: #ff3b1f; } ``` ## 6. The dual-track (upload + processing) A bar showing two pipelines at once: the lower track is fully filled (upload done), the upper is partial (processing in progress). Useful for "uploaded, now transcoding" flows. <InlineSandbox html={`<!doctype html><html><head><style> body{margin:0;background:#0a0a0a;display:grid;place-items:center;height:100vh;font-family:ui-sans-serif,system-ui;color:#fff;} .wrap{width:480px;} .bar{height:14px;background:#1a1a1a;border-radius:999px;overflow:hidden;position:relative;} .t1{position:absolute;inset:0 38% 0 0;background:#ff3b1f;border-radius:999px;} .t2{position:absolute;inset:0 0 0 0;background:linear-gradient(90deg,transparent 62%,rgba(255,255,255,.15) 62%);background-size:20px 14px;animation:tick 1s linear infinite;} @keyframes tick { to { background-position: 40px 0; } } .lg{display:flex;gap:14px;margin-top:10px;font-size:11px;color:rgba(255,255,255,.7);} .lg b{font-weight:700;color:#fff;} .dot{width:8px;height:8px;border-radius:50%;display:inline-block;background:#ff3b1f;margin-right:6px;vertical-align:middle;} .dot.q{background:rgba(255,255,255,.3);} </style></head><body> <div class="wrap"><div class="bar"><div class="t1"></div><div class="t2"></div></div> <div class="lg"><span><i class="dot"></i><b>Uploaded</b> 62%</span><span><i class="dot q"></i><b>Transcoding</b> queued</span></div></div> </body></html>`} height={160} caption="Dual-track: upload done, transcode pending." /> ## 7. The radial / circular bar When the bar should fit inside a card, not stretch across one. SVG circle with `stroke-dashoffset`. See [animated pie chart](/blog/animated-pie-chart-css) for the same primitive applied to ratio charts. ## 8. The wave bar For audio uploads. A row of vertical bars, each with a randomized phase, swaying. Five lines of CSS keyframes. The "audio waveform that means nothing" trope, but useful where you genuinely don't have a real waveform yet. ## 9. The number-mirror bar A bar with the percentage value mirrored in big type underneath. The two animate together. For dashboard "headline number" hero shots, this is the move. ```html <div class="hero"> <div class="bar"><div class="fill" style="width:62%"></div></div> <div class="num">62<span>%</span></div> </div> ``` The number uses `tabular-nums` and counts up alongside the width. Pair it with [an animated counter](/blog/animated-number-counter-html) for the count logic. ## 10. The "step ticker" bar Combines the segmented bar with a label per step. Used in checkout flows, onboarding wizards, anywhere the user needs to know "step 3 of 5" without thinking. <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:#fff;display:grid;place-items:center;height:280px;font-family:ui-sans-serif,system-ui;} .wrap{width:480px;} .steps{display:grid;grid-template-columns:repeat(5,1fr);gap:8px;} .step{display:grid;gap:6px;} .step .bar{height:4px;background:rgba(255,255,255,.12);border-radius:999px;} .step.on .bar{background:{{$ACCENT}};} .step.on.curr .bar{background:linear-gradient(90deg,{{$ACCENT}} 60%,rgba(255,255,255,.12) 60%);} .step .lbl{font:600 10px ui-monospace,monospace;letter-spacing:.18em;text-transform:uppercase;color:rgba(255,255,255,.55);} .step.on .lbl{color:#fff;} .title{font-weight:700;font-size:22px;letter-spacing:-.01em;margin-bottom:14px;} </style> <div class="wrap"> <div class="title">{{$TITLE}}</div> <div class="steps"> <div class="step on"><div class="bar"></div><div class="lbl">Account</div></div> <div class="step on"><div class="bar"></div><div class="lbl">Plan</div></div> <div class="step on curr"><div class="bar"></div><div class="lbl">Payment</div></div> <div class="step"><div class="bar"></div><div class="lbl">Confirm</div></div> <div class="step"><div class="bar"></div><div class="lbl">Done</div></div> </div> </div>`} knobs={[ { name: "TITLE", label: "Title", default: "Step 3 of 5 — Payment" }, { name: "ACCENT", label: "Accent", type: "color", default: "#ff3b1f" }, { name: "BG", label: "Background", type: "color", default: "#0a0a0a" } ]} /> ## Calibration rules Three rules that apply across all ten: 1. **Determinate bars use width transitions, not animations.** A `transition: width 0.2s` snaps cleanly to a new value; a keyframe animation will fight you when the value changes mid-animation. 2. **Indeterminate bars use animations, never transitions.** Animation runs deterministically; you don't want the indeterminate state to interpolate when the loop restarts. 3. **Color = state, width = progress.** Don't mix. A bar that changes from blue to green at 100% is two pieces of information competing for the same channel. ## Rendering to video All ten render straight to MP4 through the [render pipeline](/tools/html-to-video) since they're pure CSS. The common use case: a 4-second loop for "your file is processing" baked into a help video. The indeterminate bar (#3) is the right pick — it loops cleanly and reads as "active" without claiming a specific progress value. [Open the playground](/playground), pick a bar, drop it into a card, render the loop. --- # The agent's camera URL: https://hyperframes.video/blog/the-agents-camera Published: 2026-05-03T16:00:00.000Z Tags: agents, llm, design, workflow Author: ren-park The first time I gave Claude a HyperFrames composition and asked it to "make the second title card feel more confident," it took twelve seconds. The title moved up 4 pixels, the easing curve changed from `ease-out` to `cubic-bezier(.2,.7,.1,1)`, and the font weight went from 500 to 600. The video, when I rendered it, was visibly better. I had not asked for any of those specific changes. I had asked it to make a video feel more confident, and it had translated that into precise edits to a file. I have been a filmmaker. I have been a developer advocate at a CI company. I have spent the last year watching language models slowly become very, very good at writing motion design, and I want to tell you what I have learned about how to design tools for them. Because the tools matter enormously. Most video pipelines, handed to an LLM, produce slop. HyperFrames, handed to the same LLM, produces work that ships. The difference is not the model. The difference is what we expose to it. ## The author is no longer human-shaped For thirty years, every video tool has been designed for a human sitting at a desk with a mouse. The interface is timelines and bezier curves and modal dialogs and undo stacks. The mental model is *direct manipulation*: you grab a thing and drag it. The author and the tool are in continuous physical conversation. An LLM does not have hands. It cannot drag a bezier handle. It cannot watch a preview and decide to nudge a keyframe by two pixels. It can, however, read a file, understand the structure, and write a different file. The natural interface for an LLM is the source code of the composition itself. Plain text. Versionable. Diffable. Greppable. This is why HyperFrames is HTML. We could have invented a new format. Many tools have. But every new format is a tax on every model — the model has to learn the schema, learn the gotchas, learn what is legal and what is not. HTML is free. The model already knows it. It has read a billion examples. When we expose a composition as HTML, the model arrives pre-trained. The [OpenAI integration](/integrations/openai) wires this up as a single function-calling tool you can drop into an existing agent. ## What an LLM needs from a video tool I have spent a lot of time watching agents author videos. The pattern of what they need is consistent across models — GPT, Claude, Gemini — and it is different from what a human needs. **A single file as the substrate.** Humans tolerate project files with seventeen scattered assets. Agents need everything in one place, or they hallucinate paths. Our compositions are single HTML files with inline styles and inline scripts. Assets are referenced by URL or base64. The whole video fits in a context window. **Deterministic render.** I have written about this elsewhere on this blog, but it matters specifically for agents. An LLM in an iteration loop needs the feedback to be stable. If the render is noisy, the gradient is garbage. We described why earlier in [A deterministic video manifesto](/blog/deterministic-video-manifesto). **A linter, not a debugger.** Humans debug by stepping through. Agents debug by *re-reading the error*. The richer the error, the better the next attempt. `hyperframes lint` produces structured errors: line numbers, expected versus actual, suggested fixes. Every error message ends with the question, "what would an LLM do with this?" If the answer is "be confused," we rewrite the message. **A preview that the agent can see.** We expose `hyperframes preview --json` which outputs a structured description of the composition at frame intervals — text, positions, colors, durations. An agent can sample-render at 0s, 1s, 2s and read the JSON to verify what it built without having to look at pixels. (When pixels are needed, we render to PNG and pipe it back through a multimodal model. But the JSON path is the fast loop.) **Sub-composition reuse.** Agents are excellent at composition. If you give them a `<LowerThird>` and a `<KPICard>` and a `<TitleCard>`, they will reach for them. If you make them write motion graphics primitives from scratch every time, they will, but the output will be worse and the iteration will be slower. We ship a registry of these primitives at `hyperframes add`. The agent imports them. ## The shape of the agent loop When an agent authors a video in HyperFrames, the loop looks like this, every time: 1. Read the brief. 2. Generate a draft composition HTML. 3. Run `hyperframes lint`. Read the errors. Fix. 4. Run `hyperframes preview --frame 1500 --out probe.png`. Look at the still. 5. Render the full thing. Open the MP4 (or, more usefully, watch the timeline JSON). 6. Compare against the brief. Identify the largest delta. 7. Edit one or two things. Go to 3. What is striking about this loop is how *much like a human's loop* it is. A motion designer at a desk does the same thing: draft, preview, lint mentally, render a probe, compare. The reason the agent can do this is that we have exposed all the same tools the human uses, but in a form the agent can drive. Lint runs in the terminal. Preview produces a file. Render produces a file. The agent can run all of these with Bash. There is no GUI to click. If you have used Cursor or Claude Code to write a TypeScript project, you have used this pattern. Agents are excellent at developer loops — write, lint, run, read output, fix. The bet HyperFrames made early was: make the video loop look exactly like a developer loop. Then every existing agent already knows how to drive it. ## What agents are bad at I want to be honest about the failure modes, because they are real. Agents are bad at *taste*. They will produce compositions that are technically correct, on-brief, well-easing, and somehow lifeless. The difference between a good motion designer's work and an agent's work is rarely a specific keyframe — it is the gestalt of fifty small choices, every one of which the agent had no preference about. We address this by shipping opinionated defaults. Our default eases are not browser-default; they are tuned. Our default type scale is not Tailwind-default; it is a typographer's. The agent, lacking taste, inherits ours. Agents are bad at *long-form pacing*. A 5-second composition? Fine. A 60-second composition with three acts and a turn at 0:38? Hard. The model loses the thread of the larger structure when every individual frame is a local optimization. We are experimenting with composition outlines — a YAML file the agent writes first that describes the beats — and then the agent fills in HTML for each beat. Early results are encouraging but not shippable. Agents are bad at *brand consistency across many compositions*. If you have a brand system with fifteen rules, the agent will follow ten of them in any given composition, but a different ten each time. We address this with a `brand.css` that the agent imports verbatim — colors, type, spacing — so the rules cannot drift. ## Why the agent's camera is HTML, specifically Could we have built this on Remotion? Manim? A custom DSL? Yes to all three, and I have shipped on all three. Here is why HTML wins for the agent case in particular. Remotion is React, which is JavaScript, which is fine — but the agent has to learn the timeline conventions, the composition API, the frame counter. None of these are universally known. Every Remotion project I have seen Claude write has at least one off-by-one in the frame math, because the model is reasoning about a custom timeline rather than the CSS clock it already knows. (We get into the full architectural delta in the [Remotion comparison](/compare/remotion).) Manim is Python and beautiful for technical animation. But Manim is also a deep API, and the agent has to remember which `Animation` subclass to use for which effect. When it forgets, the output is wrong in a way that does not lint. The error mode is silent. A custom DSL is the cleanest in theory and the worst in practice. Every DSL is a tax. The first thing the agent does with a DSL is mistranslate the brief into it. HTML has no translation step. The agent is writing in the substrate. This is also why we chose plain CSS keyframes over a JS animation library. Frameworks like GSAP and Anime are wonderful for humans because they hide complexity. They are bad for agents because they hide complexity. The agent cannot reason about an effect it cannot see in the source. We support GSAP for advanced use cases, but our defaults are vanilla CSS for a reason. ## The next year of agentic video Two things will happen quickly. First, the latency from "describe a video" to "ship a video" will fall below thirty seconds for a meaningful range of briefs. We are already there for short title cards, lower thirds, social ads. The full minute-long product explainer is six months out, maybe twelve. The bottleneck is not the model; it is the toolchain. Second, the unit of authorship will shift from "one video" to "ten thousand variants." When an agent can ship a video in thirty seconds, you do not ship one — you ship a campaign. Different copy, different lengths, different aspect ratios, different markets. We will write more about this in [Render 10k variants overnight](/blog/render-10k-variants-overnight). The short version: the agent is the camera, and the camera is now cheap enough to point a thousand times. If you are building anything that touches video — marketing, education, social, internal training, regulated communications — your team is about to grow by one. The new teammate works fast, takes direction in English, and never tires. The tool you give it is the multiplier. Make sure the tool was built with this teammate in mind. We built HyperFrames for that teammate. Open a terminal. Run `npx hyperframes init`, or pair-program with the model directly in the [HyperFrames playground](/playground). Ask Claude to make you something. Then come back and tell us what you saw. --- # A deterministic video manifesto URL: https://hyperframes.video/blog/deterministic-video-manifesto Published: 2026-05-02T14:00:00.000Z Tags: determinism, rendering, engineering, ci Author: kira-tanaka Here is a test I run on every video tool I am asked to take seriously. I render the same composition twice on the same machine. I open both MP4s in a hex editor. I diff the bytes. If the bytes differ, I close the laptop. The tool is, in the strict sense, broken. It might still be useful. It might still be the right choice for somebody. But it is not a render pipeline; it is a slot machine with very nice graphics. For the last year I have been building the render core at HyperFrames. We have shipped a lot of features in that time — preview, lint, batch, the agent SDK — and exactly one of them is the feature that matters. Every other feature is downstream of determinism. So I want to spend some time explaining what determinism actually means in this context, why nearly every existing video tool fails the byte-diff test, and what it took to pass it. ## Determinism is a precondition, not a property When most people say "deterministic," they mean "I get a sensible-looking output most of the time." That is not what the word means in compiler theory, in cryptography, in build systems, or here. Deterministic means: given the same inputs, the same output, every time, on every machine that runs the same engine version. Bit for bit. No wiggle, no jitter, no "close enough." The reason I am pedantic about this is that *every interesting downstream property of a render system is impossible without it*. You cannot cache renders. You cannot prove a regression. You cannot ship a CI pipeline that fails on visual diff (which is exactly what we wire up in the [GitHub Actions integration](/integrations/github-actions)). You cannot run a render farm. You cannot let an agent iterate on a video and trust that the second render reflects only the changes it made. You cannot show a customer two variants and tell them which one is which. The same logic applies in software. Reproducible builds were not invented because Debian maintainers had a philosophical preference. They were invented because once you have them, supply-chain attacks become tractable, build caches become safe, and binaries become diffable. The pre-reproducible world is not "worse." It is *qualitatively different* — different problems are tractable in each. ## What breaks determinism in browsers The browser was not designed to be a deterministic frame source. It was designed to render pages quickly under variable load, on variable hardware, with variable network conditions. Every part of that sentence is the opposite of what a render farm needs. The specific failure modes are worth cataloging, because each one demands a specific fix. **Animation jitter.** A CSS `@keyframes` animation, played back live, advances by however much time the browser thinks has elapsed between frames. On a fast machine, that is ~16.7ms; on a slow one, more; under contention, anything. The composition at "two seconds in" depends on whether the browser dropped frames before that point. You cannot render against this. The fix: pause every animation (`animation-play-state: paused`) and drive timing externally. We dispatch an `hf-seek` custom event with `{ time: t }` and let CSS variables propagate that to `animation-delay`. The keyframe interpolator still runs; we just lie to it about the clock. **JavaScript clocks.** `Date.now()`, `performance.now()`, `Math.random()`, `requestAnimationFrame` callbacks — all of these vary across runs. Every one is a determinism hazard. We patch them inside the render context. `Date.now` returns a function of the seek time. `Math.random` is a seeded PRNG. `requestAnimationFrame` runs synchronously when called during a seek. **Font loading.** Fonts arrive over the network. If frame 0 captures before the font is ready, frame 0 has fallback glyphs. We await `document.fonts.ready` before the first capture, and we hash the resolved font URLs into the build fingerprint. **GPU compositing.** Hardware-accelerated layers introduce subpixel variance across drivers. For most compositions this does not matter — the variance is below the perceptual threshold and well below the H.264 quantizer. For pipelines that need byte-identity (regulated industries, forensic tools), we expose `--gpu off` and accept the speed hit. **Image decode order.** Two images decoded in the same frame can resolve in either order. We block on `decode()` for every image referenced before the first seek. ## The bytewise audit How do you actually verify all of this? You build an audit. Ours runs nightly. It takes a hundred compositions from our regression suite, renders each one twice in fresh Chromium instances, computes SHA-256 of every output frame as PNG, and diffs. A single mismatched hash fails the build. When we started, we had two compositions that passed and ninety-eight that failed. A year later, ninety-eight pass and two are still flaky — both involve real WebGL with float texture rounding, and we ship them with a warning. Every category of failure we hit had a specific cause: - Animation timing race conditions (fixed by event-driven seek) - Font load races (fixed by `fonts.ready` await + URL hashing) - Math.random in keyframe positions (fixed by seeded PRNG) - Image decode ordering (fixed by serial `decode()` awaits) - A surprising case where Chromium's text rasterizer used different subpixel positioning if the previous frame had hovered the cursor in a different position — we now reset cursor position before every seek Each fix is small. The discipline is in *catching* each one. The audit is the discipline made automatic. ## Determinism enables agents I want to spend a moment on why this matters for AI agents specifically, because it is the most-asked question we get. When an agent generates a video composition, it does so in a loop. It writes HTML, renders, looks at the output, adjusts. Maybe ten iterations, maybe fifty. If the render is non-deterministic — if rendering the same HTML produces a different frame each time — the agent has no signal. It cannot tell whether the change it just made improved the output or whether the renderer was simply in a different mood. The feedback loop collapses. The agent flails. We have measured this. On our internal benchmark of "agent gets a brief, produces a finished video," the success rate on a non-deterministic renderer (we tested two open-source competitors, names withheld) is around 23%. On HyperFrames it is 71% — you can poke at the same surface agents use in the [interactive playground](/playground). Same agent, same prompts, same model. The only difference is whether the renderer gives the agent a stable signal. This is the same observation Karpathy made about compilers years ago: you cannot do gradient descent through a noisy loss function. Determinism is the loss function being clean. Without it, the gradient is garbage and the agent never converges. ## What determinism costs It would be dishonest to pretend this is free. Determinism imposes a cost, and it is worth being clear about what that cost is. The first cost is that some browser features become off-limits, or require careful handling. `setTimeout` outside the seek loop is forbidden. `Math.random` without a seed is forbidden. Reading the actual wall clock for animation is forbidden. The composition has to be a *function of t*, not a process that evolves over time. Most authors find this clarifying once they internalize it, but it is a real constraint. The second cost is render speed. A deterministic renderer cannot use the same shortcuts a live browser does. We cannot rely on background decode, asynchronous layout, or speculative rasterization. Each frame is a synchronous round-trip from seek to capture. This means we render slower than a browser that does not care about correctness — typically 30-50% slower per frame on the same hardware. We are okay with that trade because the alternative is "fast renders of the wrong frame." The third cost is engineering effort. Maintaining the patched JavaScript environment, the seeded PRNGs, the font-load guards, the GPU pinning, the audit — this is all infrastructure work that does not directly produce features users see. It is the kind of work that pays for itself only after the third or fourth time a customer asks "why does my CI render look different from my local one" and the answer is "it doesn't, here is the byte-identical hash." ### Why nobody else does this It is reasonable to ask: if determinism is so important, why have decades of video tools shipped without it? The honest answer is that for most of that history, video was authored by humans and consumed by humans, and humans do not notice four-bit variance in the green channel of a single frame. The whole pipeline was tuned for human authorship and human consumption, and a little entropy on the way through was fine. The shift is that videos are increasingly authored by software and consumed by software. CI systems do not have eyes. Agents do not have eyes. Cache layers do not have eyes. Everything downstream of a render is now byte-sensitive in a way it has not been before. The tool that catches up first is the tool that wins. ## What you should demand If you are evaluating a video tool today, here is the test (and if you want a head-to-head, see how we [compare to Remotion](/compare/remotion) on this exact axis). Render the same composition twice. Diff the bytes. If they differ, ask the vendor why, and listen carefully to the answer. The answers fall into three categories. The first category is "we have not done that work yet." This is honest, and you can decide whether to wait. The second category is "determinism is not important for our use case." This is also honest, and you can decide whether your use case is the same as theirs. The third category is "well, technically the output is visually equivalent but there are some sub-perceptual variations in the encoder pass." This is the slot machine talking. Run. ## What we still get wrong I want to close on a note of honesty. Even after a year of work on this, we are not perfectly deterministic in every case. Two failure modes remain. The first is WebGL with floating-point textures. The GPU drivers across vendors produce slightly different results for the same float operations, and we cannot fully normalize this at the engine level. We disable GPU rasterization for byte-sensitive renders, but if your composition contains a Three.js shader, the output may differ by a few least-significant bits across machines. We flag this loudly in lint and document it in our determinism guarantees. The second is the timezone-sensitive code that some authors smuggle into compositions. If your script reads the local timezone to format a date string, the output of that string depends on where the renderer is running. We patch `Date` to a fixed UTC offset and warn about `Intl.DateTimeFormat` usage, but a determined author can still introduce nondeterminism. We treat this as a documentation problem rather than an engine problem. In both cases, we are honest about the limits in our docs and in our lint output. Determinism is not a marketing claim we toss out lightly; it is a property we are accountable to. The audit catches drift. The audit is also catching us. Determinism is the line. On one side of it, you have a creative tool. On the other side, you have infrastructure. We crossed the line on purpose, and we are not going back. --- # Sports scoreboard graphics from JSON — broadcast templates in HTML URL: https://hyperframes.video/blog/sports-scoreboard-graphic-generator Published: 2026-05-02T09:00:00.000Z Tags: broadcast, sports, scoreboard, tutorial, data-viz Author: marcus-okafor If you run a league, a podcast, or a sports media account, you produce scoreboards. Every game, every recap, every social cut. The typical workflow — open After Effects, drop in team logos, type the score, render, repeat — collapses at any volume above a single match per day. The fix is the same as for any high-volume graphic: template the bug, drive the variants from JSON, render in batch. Here is the engineering build of a broadcast-grade scoreboard, including the timing rules that make it read as TV instead of as a website. ## What a "score bug" actually is A broadcast score bug, stripped of network branding, is six pieces of information: 1. Team A: logo, abbreviation, score 2. Team B: logo, abbreviation, score 3. Period / quarter / inning indicator 4. Game clock 5. Optional: possession indicator, count, runners on base 6. Optional: status banner ("HALFTIME," "FINAL," "TIMEOUT") Six fields. The visual variation across networks is mostly typography and which fields are present — the underlying structure is the same. ## The 16:9 lower-third layout A score bug lives in the lower-left or lower-right corner of the frame, occupying about 22% of the width and 8% of the height. Two stacked rows, one per team, each row split into logo / abbreviation / score. <InlineSandbox html={`<!doctype html> <html><body style="margin:0;background:#1a3050;display:grid;place-items:end;min-height:100vh;padding:24px;font-family:ui-sans-serif,system-ui;"> <div style="display:grid;grid-template-columns:auto auto;gap:1px;background:rgba(255,255,255,.1);border-radius:6px;overflow:hidden;color:#fff;box-shadow:0 8px 24px rgba(0,0,0,.4);"> <div style="display:grid;grid-template-rows:auto auto;background:#0a0a0a;"> <div style="display:grid;grid-template-columns:30px 60px 50px;align-items:center;padding:8px 10px;border-bottom:1px solid rgba(255,255,255,.08);background:#0a0a0a;"> <div style="width:22px;height:22px;border-radius:50%;background:#ff3b1f;"></div> <div style="font-weight:800;letter-spacing:.05em;font-size:15px;">NYC</div> <div style="font-weight:800;font-size:18px;text-align:right;font-variant-numeric:tabular-nums;">21</div> </div> <div style="display:grid;grid-template-columns:30px 60px 50px;align-items:center;padding:8px 10px;background:#0a0a0a;"> <div style="width:22px;height:22px;border-radius:50%;background:#1f5fff;"></div> <div style="font-weight:800;letter-spacing:.05em;font-size:15px;">BOS</div> <div style="font-weight:800;font-size:18px;text-align:right;font-variant-numeric:tabular-nums;color:#ffb800;">17</div> </div> </div> <div style="background:#ffb800;color:#000;display:grid;grid-template-rows:auto auto;align-items:center;padding:0 14px;font-weight:800;font-variant-numeric:tabular-nums;"> <div style="font-size:11px;letter-spacing:.2em;border-bottom:1px solid rgba(0,0,0,.2);padding:4px 0;text-align:center;">Q3</div> <div style="font-size:18px;padding:4px 0;text-align:center;">04:21</div> </div> </div> </body></html>`} height={300} caption="A two-row score bug with period + clock module. Drop into a 16:9 frame anywhere." /> The accent color on `BOS`'s score is doing important work — it tells the eye "they just scored." Use a brand color for the team that most recently put points on the board, fade it back to white after 3 seconds. This is the "scoring blink" — every network does it. ## The team variable Two teams, each with logo (image URL), abbreviation, full name, color. A JSON entry looks like: ```json { "id": "match-2026-05-12-nyc-bos", "period": "Q3", "clock": "04:21", "status": "live", "home": { "abbr": "NYC", "color": "#ff3b1f", "logo": "/teams/nyc.svg", "score": 21 }, "away": { "abbr": "BOS", "color": "#1f5fff", "logo": "/teams/bos.svg", "score": 17 }, "highlightTeam": "away" } ``` The template reads this JSON and renders. To produce a recap reel for the day, loop the JSON file over every match and render one MP4 per game. ## Score transitions When the score changes during a render, the digit should not snap — it should swap with motion. The two patterns that work: - **Slot-machine roll** (digits scroll vertically). Use for fast-paced sports where score changes constantly (basketball). - **Crossfade** (old digit fades down, new fades up). Use for slow-paced (baseball, golf, soccer). A 250ms duration for either. Anything longer and the score change reads as a glitch instead of an update. <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:#fff;display:grid;place-items:center;height:280px;font-family:ui-sans-serif,system-ui;} .bug{display:grid;grid-template-columns:auto auto;gap:1px;background:rgba(255,255,255,.1);border-radius:6px;overflow:hidden;box-shadow:0 8px 24px rgba(0,0,0,.4);} .tm{display:grid;grid-template-rows:auto auto;background:#0a0a0a;} .row{display:grid;grid-template-columns:30px 60px 50px;align-items:center;padding:8px 10px;background:#0a0a0a;border-bottom:1px solid rgba(255,255,255,.06);} .row:last-child{border-bottom:none;} .logo{width:22px;height:22px;border-radius:50%;} .abbr{font-weight:800;letter-spacing:.05em;font-size:15px;} .sc{font-weight:800;font-size:18px;text-align:right;font-variant-numeric:tabular-nums;} .mod{background:{{$CLOCK_BG}};color:#000;display:grid;grid-template-rows:auto auto;align-items:center;padding:0 14px;font-weight:800;font-variant-numeric:tabular-nums;} .mod .p{font-size:11px;letter-spacing:.2em;border-bottom:1px solid rgba(0,0,0,.2);padding:4px 0;text-align:center;} .mod .c{font-size:18px;padding:4px 0;text-align:center;} </style> <div class="bug"> <div class="tm"> <div class="row"><div class="logo" style="background:{{$HOME_COLOR}};"></div><div class="abbr">{{$HOME}}</div><div class="sc">{{$HOME_SCORE}}</div></div> <div class="row"><div class="logo" style="background:{{$AWAY_COLOR}};"></div><div class="abbr">{{$AWAY}}</div><div class="sc">{{$AWAY_SCORE}}</div></div> </div> <div class="mod"><div class="p">{{$PERIOD}}</div><div class="c">{{$CLOCK}}</div></div> </div>`} knobs={[ { name: "HOME", label: "Home abbr", default: "NYC" }, { name: "HOME_COLOR", label: "Home color", type: "color", default: "#ff3b1f" }, { name: "HOME_SCORE", label: "Home score", default: "21" }, { name: "AWAY", label: "Away abbr", default: "BOS" }, { name: "AWAY_COLOR", label: "Away color", type: "color", default: "#1f5fff" }, { name: "AWAY_SCORE", label: "Away score", default: "17" }, { name: "PERIOD", label: "Period", default: "Q3" }, { name: "CLOCK", label: "Clock", default: "04:21" }, { name: "CLOCK_BG", label: "Clock bg", type: "color", default: "#ffb800" }, { name: "BG", label: "Frame bg", type: "color", default: "#1a3050" } ]} /> ## Status banners Halftime, full-time, timeout, ejection — these are "interrupts" that take over the bug for 3–5 seconds. Implement them as a third stacked module that slides in from the right, sits, then slides out. The slide is short — 250ms — and uses the same ease curve as the score change. Consistency in motion timing is what makes a graphic package read as a system rather than a collection. ## Recap videos: render in batch The bigger win is the post-game recap. Render thirty 6-second clips, one per scoring play, each with the corresponding bug state. Stitch them in your editor. The bug stays consistent across every clip because every render came from the same template. For full recap automation, see [batch personalized videos](/blog/batch-personalized-videos-from-csv) — the orchestration pattern is identical. ## What you give up by templating Network broadcasts have one thing this approach doesn't: a dedicated graphics operator who tweaks the bug live based on context. You will not have that. The trade is throughput — one template, one engineer, every game covered. For 95% of league media and 100% of social cuts, that trade is correct. [Open the playground](/playground), drop the score bug in, render the recap reel for last week's games. --- # HTML is the next video codec URL: https://hyperframes.video/blog/html-is-the-next-video-codec Published: 2026-04-30T15:00:00.000Z Tags: philosophy, rendering, html, codec Author: hf-team Most people, when they hear the word *codec*, picture a black box that takes pixels in and produces a smaller pile of pixels out. H.264, AV1, ProRes — these are containers that solve one problem: compressing a stream of already-rendered images. They are the last mile of a long journey that started somewhere else, usually in After Effects or Premiere, sometimes in a game engine, occasionally in a Python script with `cv2.imwrite` in a loop. We think the journey is upside down. The interesting question is not how to compress pixels. The interesting question is: what is the smallest, most expressive, most diff-able description of *what should be on the screen at time t* — and can we render it deterministically? Once you answer that, the codec moves up the stack. The document becomes the source of truth. The pixels are a build artifact. For us at HyperFrames, the answer to "what is the most expressive description" is increasingly obvious: it is HTML. Or more precisely, HTML plus CSS plus a small amount of seek-friendly JavaScript. We will spend the next two thousand words unpacking why. ## Pixels are the wrong primitive Every modern video file is a sequence of pixel grids with motion vectors and a few clever tricks bolted on. That representation is excellent for the moment you want to *play* a video and useless for everything else. You cannot diff two MP4s. You cannot grep them. You cannot version-control them in a way that means anything. You cannot ask Claude to "make the second chart twice as tall" and get a useful answer. The problem is that the pixel-grid representation throws away every piece of authorial intent on the way down. The fact that this red bar should grow from 0 to 60 over 1.2 seconds with an ease-out cubic? Lost. The fact that this caption is the second sentence in a three-sentence sequence? Lost. The fact that the brand color is `#ff3b1f` and should never drift even one byte? Lost — at least until someone re-extracts it from a frame and discovers the encoder pushed it to `#ff3a1e`. If you have ever tried to make a small text change to an existing 30-second ad, you know how this story ends. You open the original project file. The project file references seventeen assets, half of which have moved. You re-render. The new MP4 is structurally identical to the old one except for one word, and you ship 38 megabytes of new pixels to convey three letters of change. ## What HTML already gets right The browser is the most-tested rendering engine in the history of software. Every typography rule, every easing curve, every blend mode, every color space, every shader is debugged across hundreds of millions of devices a day. We do not need to build a new motion graphics renderer. The good one already exists, and it ships in every laptop. HTML and CSS, taken together, are a remarkably good language for describing *how a scene should look at time t*. They have layout (flex, grid, absolute), they have type (variable fonts, OpenType features, italic small caps), they have color (HSL, OKLCH, P3, gradients), they have animation (`@keyframes`, `animation-delay`, `animation-fill-mode`), they have compositing (filters, masks, blend modes), they have 3D transforms, they have SVG and Canvas and WebGL when you need to descend a level. They are, crucially, *declarative*: the document at time t is a function of the document plus t, not of accumulated mutation. The declarative property is the one that matters for video. A deterministic renderer needs to seek. It needs to ask the document, "what should you look like at exactly 1.234 seconds?" and get the same answer every time, regardless of frame rate or thread schedule. CSS animations, when driven by `animation-delay` and `animation-play-state: paused`, give you that. JavaScript that listens for an `hf-seek` event and writes computed state into the DOM gives you that. The browser gives you that. ## What the codec metaphor unlocks Once you accept that the document is the codec, four things happen in quick succession. First, your video is suddenly a text file. Twelve kilobytes of HTML rendered at 1080p produces a hundred megabytes of MP4. The HTML is what you store, diff, review in pull requests, ship through your CI, ask an agent to modify. The MP4 is generated on demand. You no longer have a binary build artifact masquerading as a creative asset. Second, your video gains a type system. The composition is structured: title cards have classes, captions have data attributes, charts read from JSON. You can lint it. You can statically analyze it. You can refuse to render if the duration is wrong. We ship `npx hyperframes lint` for exactly this reason. Third, your video becomes composable in the same way websites are. You write a `<LowerThird>` component once and reuse it across thirty videos. You bump a brand token and every composition rebuilds with the new color. You import a chart from a different file. None of this is novel — it is the same logic that made React win. We are just pointing it at frames instead of pages. Fourth, and most consequentially: agents can write video. An LLM that has read a billion HTML pages is fluent in CSS keyframes — which is why our [OpenAI integration](/integrations/openai) lets you generate compositions directly from a function call. It is not fluent in After Effects scene graphs, because there are not a billion of those on the public internet. The shortest path from a prompt to a frame, today, runs straight through HTML. ## What the browser still gets wrong We are not going to pretend the browser was designed for this. There are real problems and they are worth naming. The first is determinism. By default, a browser is a soft-real-time renderer: it tries to draw things at 60 frames per second, dropping frames if it cannot keep up, jittering animations when other tabs steal CPU. That is the opposite of what a frame-perfect render farm needs. The fix is to pause every animation and drive it from a single clock. HyperFrames does this with a `hf-seek` event: the engine sets `document.documentElement.style.setProperty` for animation timing, dispatches the event, waits for `requestAnimationFrame` to settle, then captures the frame. The browser becomes a synchronous time machine. The second is fonts. Web fonts arrive over the network, and they arrive at unpredictable times. A first-frame render that fires before the font has loaded looks nothing like the second-frame render that fires after. We solve this by waiting on `document.fonts.ready` before the first capture, and by warning loudly when a composition references fonts not in the bundle. If you have ever shipped an ad with the wrong font because the staging environment had a different cache, you know exactly the bug we are preventing. The third is GPU variance. Two machines running the same Chromium with the same composition can produce subtly different anti-aliasing, particularly for filters and 3D transforms. We pin the Chromium version. We pin the device-pixel-ratio. We disable the GPU compositor when bitwise determinism matters more than performance. It is not free, but it is honest. (For a side-by-side with the closest peer in this space, see how HyperFrames [compares to Remotion](/compare/remotion).) ### The encoder is still an encoder To be clear: we still encode to MP4 at the end. The codec metaphor does not mean we have invented a new video format the world has to play. It means the place where authorial intent lives, and the place where pixels live, are now different places. The encoder becomes a boring last step instead of an opinion-laden creative tool. We use H.264 because every device on earth plays it; we use AV1 when bandwidth matters; we use ProRes when an editor downstream wants to color-grade. The interesting layer is upstream. ## What this looks like in practice A HyperFrames composition is a single HTML file with a small amount of metadata. You can render it with one command: ```bash npx hyperframes render hero.html \ --out hero.mp4 \ --duration 5 \ --width 1920 --height 1080 --fps 60 ``` The engine boots a headless Chromium, opens the file, waits for fonts and images, then loops from frame 0 to frame (duration × fps). For each frame, it dispatches `hf-seek`, lets the browser settle, captures, and pipes to ffmpeg. The output is bit-identical across machines that share the same engine version. The composition itself looks like a web page that happens to be five seconds long: ```html <style> .title { font-family: "Newsreader", serif; font-size: 96px; animation: rise 700ms cubic-bezier(.2,.7,.1,1) 200ms backwards; animation-play-state: paused; } @keyframes rise { from { opacity: 0; transform: translateY(24px); } to { opacity: 1; transform: translateY(0); } } </style> <h1 class="title">Hello, frame 0.</h1> ``` The engine drives `animation-delay` from the seek event. The composition is, structurally, the codec. ## Why this is a generational shift For most of the last forty years, computer graphics has lived in two worlds. There is the real-time world (games, 3D apps) where you write shaders and accept whatever the GPU draws. And there is the offline world (film, ads, motion graphics) where you write keyframes in a proprietary tool and wait for a render farm. Video on the web has lived uncomfortably between them, mostly by way of the offline world: someone makes an MP4 in a desktop tool, then someone else uploads it. The "web video" pipeline has been Premiere with extra steps. We think there is a third world emerging. It is one where compositions are documents — versioned, diffable, agent-writable, deterministic on render — and where the act of "making a video" is much closer to the act of building a static site. The codec is HTML. The renderer is the browser. The output is a sequence of pixels you ship to wherever pixels are useful. The next decade of video will be written, not exported. The fact that this sentence sounds obvious only after you write it is the first sign that something is about to move. If you want to start now, `npx hyperframes init` puts a working composition on your disk in under a minute, or you can poke at compositions directly in the [browser playground](/playground). The future of video is text. Open your editor. --- # How to burn subtitles into an MP4 (and why you should) URL: https://hyperframes.video/blog/burn-subtitles-into-mp4 Published: 2026-04-30T09:00:00.000Z Tags: subtitles, captions, video, ffmpeg, tutorial Author: kira-tanaka Eighty-five percent of social video plays muted. Your video either has subtitles or it has nothing. The default move is FFmpeg's `subtitles=` filter, which reads an SRT and burns it into the frame. It works. It also looks like 2010, ignores brand typography, and gives you exactly one (1) lever — font size. If your video has any visual identity, you want a different path. This is the design-controlled burn-in: render subtitles as HTML inside the video template, frame-aligned to the same timeline as everything else. You get the typography you actually want, the line breaks at the right places, and full control over background blocking, kerning, and animation. ## What burned-in subtitles actually need Five things, in order of impact on legibility: 1. **High-contrast backing.** Solid black at 65–80% opacity behind the text. Don't try to read 18pt sans-serif against arbitrary video without a backing — you'll lose 30% of viewers in low-contrast scenes. 2. **Bottom-third positioning, with safe-area margin.** The bottom 15% of the frame, with a 4% margin from the bottom edge. 3. **Two lines max, ~32 characters per line.** Anything longer scrolls or wraps inelegantly. 4. **Snap to word boundaries on timing.** Subtitles that change mid-sentence read as broken. 5. **Sans-serif, ~22pt at 1080p, weight 600.** Bold enough to read; not so bold it looks like a meme template. Get those five right and your captions read on any platform without further tuning. <InlineSandbox html={`<!doctype html> <html><body style="margin:0;background:linear-gradient(135deg,#3a2a1a,#5a3a1a);height:100vh;font-family:ui-sans-serif,system-ui;color:#fff;position:relative;display:grid;place-items:center;"> <div style="font-weight:800;font-size:72px;opacity:.15;letter-spacing:-.04em;">B-ROLL</div> <div style="position:absolute;left:0;right:0;bottom:80px;display:grid;place-items:center;padding:0 24px;"> <div style="background:rgba(0,0,0,.75);padding:8px 16px;border-radius:4px;font-weight:600;font-size:18px;line-height:1.3;max-width:520px;text-align:center;backdrop-filter:blur(2px);"> The codec doesn't care about your brand,<br/>but the viewer does. </div> </div> </body></html>`} height={320} caption="Burned-in subtitle with dark backing — readable on any background." /> ## Timing: from transcript to frames The transcript-to-subtitle pipeline: 1. **Transcribe with Whisper** (or your provider of choice). You get word-level timestamps. 2. **Chunk into 2–6 word phrases.** A line break every 30 characters or every 1.5 seconds, whichever comes first. 3. **Snap to word boundaries.** Never break mid-word. 4. **Emit `[ { text, startMs, endMs } ]`**. Inside the HTML template, render the active subtitle by finding the chunk whose `[startMs, endMs]` contains the current frame's time. ```html <div class="captions" data-frame-time="0">  <div class="cap">The codec doesn't care about your brand</div> </div> ``` ## Animation: in and out, not during The two acceptable animations on a subtitle: - **In**: a 120ms fade + 4px slide-up. - **Out**: a 120ms fade (no slide). Anything more elaborate — slot-machine letter reveals, word-by-word color changes — distracts from the speech they exist to support. Keep them quiet. <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:#fff;height:320px;font-family:ui-sans-serif,system-ui;position:relative;display:grid;place-items:center;} .bg{font-weight:800;font-size:80px;opacity:.12;letter-spacing:-.04em;} .cap{position:absolute;left:0;right:0;bottom:40px;display:grid;place-items:center;} .cap .b{background:rgba(0,0,0,{{$OPACITY}});padding:{{$PAD}}px {{$PADX}}px;border-radius:{{$RADIUS}}px;font-weight:{{$WEIGHT}};font-size:{{$SIZE}}px;line-height:1.3;color:{{$FG}};max-width:520px;text-align:center;} </style> <div class="bg">B-ROLL</div> <div class="cap"><div class="b">{{$TEXT}}</div></div>`} knobs={[ { name: "TEXT", label: "Caption text", default: "Most social video plays muted." }, { name: "SIZE", label: "Font size", type: "number", default: "18", min: 12, max: 32, step: 1 }, { name: "WEIGHT", label: "Font weight", type: "number", default: "600", min: 400, max: 900, step: 100 }, { name: "PAD", label: "Vertical pad", type: "number", default: "8", min: 4, max: 20, step: 1 }, { name: "PADX", label: "Horizontal pad", type: "number", default: "16", min: 8, max: 32, step: 1 }, { name: "RADIUS", label: "Corner radius", type: "number", default: "4", min: 0, max: 24, step: 1 }, { name: "OPACITY", label: "BG opacity", default: ".75" }, { name: "FG", label: "Text color", type: "color", default: "#ffffff" }, { name: "BG", label: "Video bg", type: "color", default: "#3a2a1a" } ]} /> ## Why not FFmpeg subtitles? FFmpeg's burn-in works, and for SRT files with no design opinion, it is fine. Three reasons to skip it for produced content: 1. **Typography control is shallow.** You get font + size. You don't get kerning, line height, backing radius. 2. **No per-platform variants.** You will want shorter lines for 9:16 than for 16:9. FFmpeg can't do that without re-encoding. 3. **No animation.** A static subtitle that flicks on and off looks worse than one that fades. For a one-shot recording with an SRT, use FFmpeg. For anything you'll iterate on, render captions inside the template. ## Multi-language subtitles The same template can render multiple language variants by swapping the chunk array. The geometry stays the same; only the text changes. Ten Spanish renders + ten English renders from the same source. This is where templating beats SRT. SRT files don't ship with their own typography, so a Cyrillic SRT renders in Arial on FFmpeg's default. A template ships with the language-specific font (Noto Sans CJK for Japanese, etc.) baked into the CSS. ## Speaker-attributed captions For interview content, prefix each chunk with the speaker: ``` KIRA: We tried four codecs before AV1 came up. INT: Why not just default to AV1 from the start? ``` Render with a colored label per speaker. Two-speaker dialogue is the right complexity ceiling; three or more speakers degrades quickly without a video-conference-style spatial cue. ## The render pipeline Inside the [HyperFrames pipeline](/tools/html-to-video), captions are just another HTML element with a frame-time variable. The render loop seeks the frame-time, the active caption updates, the frame is captured. Deterministic, frame-aligned, no race conditions. The same template that renders [TikTok variants](/blog/tiktok-video-from-template) renders the burned-in subtitle pass — no separate tool, no SRT roundtrip. [Open the playground](/playground), paste a chunk array, see the caption track align to the seek bar. --- # Render a React component to MP4 (the practical way) URL: https://hyperframes.video/blog/react-to-mp4-tutorial Published: 2026-04-28T09:00:00.000Z Tags: react, mp4, rendering, tutorial, engineering Author: kira-tanaka The question comes up about once a month: "I have a React component that animates. How do I turn it into an MP4?" The answers on Stack Overflow point to Puppeteer screen-recording, which works for a demo and falls apart in production. There's a better path that respects what React is good at and treats video as a deterministic frame sequence instead of a screen recording. This is the engineering walk-through: deterministic time, props as variables, the difference between "captured" and "rendered" video, and where each approach stops working. ## The fundamental problem with screen recording The naive approach: spin up a headless browser, navigate to a React app, start a video recorder, wait for the animation to finish, stop the recorder. This works for demos and breaks in production for three reasons: 1. **Real-time playback ties recording to wall-clock**. A 10-second animation takes 10 seconds to record. A 1000-variant batch takes nearly 3 hours. 2. **Frame timing is non-deterministic**. `requestAnimationFrame` runs when the browser feels like it. Two recordings of the same animation will not be byte-identical. 3. **Dropped frames at jitter spikes**. Any GC pause or system load shows up as a stuttered moment in the output. The fix: don't record, **render frame-by-frame**. Drive the React component with an explicit time variable, snapshot each frame, encode the sequence into MP4. Wall-clock time disappears; the only thing that matters is "given t=0.42s, what does the component look like." ## The deterministic-time prop Replace `useEffect`-driven animation with a `time` prop: ```tsx // Don't: function BadCounter({ target }) { const [n, setN] = useState(0); useEffect(() => { const start = performance.now(); const tick = (now) => { const t = Math.min(1, (now - start) / 1000); setN(Math.round(target * easeOut(t))); if (t < 1) requestAnimationFrame(tick); }; requestAnimationFrame(tick); }, [target]); return <span>{n}</span>; } // Do: function GoodCounter({ target, time, duration = 1 }: { target: number; time: number; duration?: number }) { const t = Math.min(1, time / duration); const n = Math.round(target * easeOut(t)); return <span className="tabular-nums">{n}</span>; } ``` The `time` prop is the render time in seconds. The render loop sets it explicitly for each frame: 0.000, 0.033, 0.066, 0.100, ... (at 30fps). The component is pure given a `time` value. ## The render loop Conceptually: ```ts const FPS = 30; const DURATION_S = 8; const TOTAL_FRAMES = FPS * DURATION_S; for (let i = 0; i < TOTAL_FRAMES; i++) { const t = i / FPS; await renderFrame(); } await encodeFrames(); ``` `renderFrame` snapshots the rasterized React output to a PNG. `encodeFrames` muxes the PNG sequence into an MP4. In the [HyperFrames pipeline](/tools/html-to-video), both are handled — you pass a component and a duration, you get an MP4. ## Props as variables Once a component has `time` plus its data props, the same component renders any number of variants. Pass `target=1000` for one render, `target=42` for the next; same component code, two different outputs. This is the bridge to [batch rendering from CSV](/blog/batch-personalized-videos-from-csv) — each CSV row is a prop object, each render is one MP4, the React code never changes. <CodeTabs tabs={[ { label: "Component", lang: "tsx", code: `function CounterFrame({ time, target, label }: { time: number; target: number; label: string; }) { const t = Math.min(1, time / 1.5); const eased = 1 - Math.pow(1 - t, 3); const n = Math.round(target * eased); return ( <div className="frame"> <div className="num tabular-nums"> {n.toLocaleString()} </div> <div className="label">{label}</div> </div> ); }`, }, { label: "Driver", lang: "ts", code: `import { render } from "@hyperframes/render"; await render({ component: CounterFrame, props: { target: 184_920, label: "Q2 Revenue" }, duration: 4, // seconds fps: 30, size: { w: 1920, h: 1080 }, output: "./counter.mp4", });`, }, { label: "Result", html: `<!doctype html><html><head><style> body{margin:0;background:#0a0a0a;color:#fff;display:grid;place-items:center;height:100vh;font-family:ui-sans-serif,system-ui;} .f{display:grid;place-items:center;gap:10px;} .n{font-size:120px;font-weight:800;letter-spacing:-.04em;font-variant-numeric:tabular-nums;color:#ff3b1f;} .l{font:600 13px ui-monospace,monospace;letter-spacing:.25em;text-transform:uppercase;color:rgba(255,255,255,.55);} </style></head><body> <div class="f"><div class="n">184,920</div><div class="l">Q2 REVENUE</div></div> </body></html>`, }, ]} caption="Pure React component → render driver → final frame." height={360} /> ## Hooks that work, hooks that don't The reframe: any hook that depends on `time` or `props` is fine. Any hook that depends on wall-clock time is not. | Hook | Render-safe? | Notes | |---|---|---| | `useState` | Yes | Initialize from props | | `useMemo` | Yes | Pure derivation | | `useReducer` | Yes | Deterministic state machine | | `useEffect` with empty deps | Sometimes | OK for one-time setup, but no `setInterval` | | `useEffect` reading `Date.now()` | No | Replace with `time` prop | | `requestAnimationFrame` | No | Render loop drives time, not rAF | | `useTransition` | No | Concurrent rendering is non-deterministic | The general rule: if a component's output depends on anything other than its props, refactor. ## Server-side rendering vs. headless browser Two approaches to the snapshot step: - **Server-side render with `ReactDOMServer.renderToString`** → DOM string → headless browser rasterizes. Faster per-frame, but you need the browser anyway for layout. - **Render in a headless browser directly** with the component re-rendered per frame. Slower per-frame, but you skip the SSR roundtrip and animations like `transform` work natively. HyperFrames uses the second approach with a long-lived browser process (start once, render N frames, exit). The browser stays warm; only the page navigation and frame capture happen per render. ## When this approach stops working Three cases where you reach for something else: 1. **Real-time streaming with React.** Use a WebRTC pipeline, not a render pipeline. Different problem entirely. 2. **Components that depend on a backend API.** Pre-fetch the data, pass as props. Don't let the render loop wait on HTTP. 3. **Components that use Canvas or WebGL.** These work, but you lose the SVG/DOM scrubbing model. Capture works fine; deterministic re-render across worker boundaries gets trickier. For everything else — UI animations, dashboards, data-driven graphics, social posts — pure-props React renders cleanly to MP4. The mental model is "video is a function of time and props." Hold that line and the pipeline becomes mechanical. [Open the playground](/playground), paste a counter, scrub the timeline. --- # Animated route map videos — travel reels from a GPX file URL: https://hyperframes.video/blog/animated-route-map-video Published: 2026-04-26T09:00:00.000Z Tags: maps, travel, route, svg, tutorial Author: ren-park If you run a travel account, a delivery service, a cycling team, or anything that involves moving across a map — you have made a route map video. You have also probably made it in Mapbox Studio, exported a screen recording, and watched the frame rate stutter on a corner of the map you cared about most. There is a cleaner build: SVG path tracing, a marker that rides the path, a distance counter that ticks up alongside, and a basemap that stays still. All in one HTML file. From a GPX file or a JSON of coordinates, render any number of routes to MP4. ## What an "animated route" actually is Three pieces: 1. **A basemap** — either a static tile screenshot or a simplified SVG world. Doesn't move. 2. **A path** — the route polyline, rendered as an SVG `<path>` with `stroke-dasharray`-based tracing. 3. **A marker** — a circle or icon riding the path's endpoint at any given time. The illusion of "the map is being explored" comes from the path tracing, not from camera motion. Resist the urge to pan the basemap; it makes the video harder to follow. ## The path-trace primitive Same primitive as an [animated line chart](/blog/animated-line-chart-html): set `stroke-dasharray` equal to the path length, set `stroke-dashoffset` from that length to zero. Animate `stroke-dashoffset` and the path draws. For a route, the trick is converting GPS coordinates to SVG coordinates. Two steps: 1. **Project** lat/lng to a flat plane. Web Mercator is fine for most distances. 2. **Scale** the projected points to fit the SVG viewBox. Once the path is an `<svg><path d="M...">`, it animates the same way any other SVG path does. <InlineSandbox html={`<!doctype html> <html><body style="margin:0;background:#0a0a0a;display:grid;place-items:center;height:100vh;font-family:ui-sans-serif,system-ui;color:#fff;"> <div style="position:relative;width:520px;height:340px;border-radius:14px;overflow:hidden;background:radial-gradient(at 30% 40%,#1a3050,#0a1020 60%,#080810);"> <svg viewBox="0 0 520 340" style="position:absolute;inset:0;width:100%;height:100%;"> <g stroke="rgba(255,255,255,.08)" stroke-width="1" fill="none"> <path d="M0 80 L520 80 M0 160 L520 160 M0 240 L520 240 M120 0 L120 340 M260 0 L260 340 M400 0 L400 340"/> </g> <path d="M60 280 Q 120 180 180 200 T 320 120 T 460 60" fill="none" stroke="rgba(255,255,255,.15)" stroke-width="6" stroke-linecap="round"/> <path d="M60 280 Q 120 180 180 200 T 320 120 T 460 60" fill="none" stroke="#ff3b1f" stroke-width="3" stroke-linecap="round" pathLength="1" stroke-dasharray="0.65 1" stroke-dashoffset="0"> <animate attributeName="stroke-dasharray" from="0 1" to="1 0" dur="6s" fill="freeze" repeatCount="indefinite"/> </path> <circle r="6" fill="#fff" stroke="#ff3b1f" stroke-width="3"> <animateMotion dur="6s" repeatCount="indefinite" fill="freeze" path="M60 280 Q 120 180 180 200 T 320 120 T 460 60"/> </circle> <circle cx="60" cy="280" r="5" fill="#fff" stroke="#1f5fff" stroke-width="2"/> <circle cx="460" cy="60" r="5" fill="none" stroke="#fff" stroke-width="2"/> </svg> <div style="position:absolute;top:14px;left:16px;font:600 10px ui-monospace,monospace;letter-spacing:.25em;text-transform:uppercase;color:rgba(255,255,255,.55);">Route · Day 03</div> <div style="position:absolute;bottom:14px;right:16px;text-align:right;"> <div style="font:700 28px ui-sans-serif,system-ui;font-variant-numeric:tabular-nums;letter-spacing:-.02em;">62.4 km</div> <div style="font:600 10px ui-monospace,monospace;letter-spacing:.25em;text-transform:uppercase;color:rgba(255,255,255,.55);margin-top:2px;">elapsed</div> </div> </div> </body></html>`} height={400} caption="Route trace + moving marker + distance counter. Pure SVG, no map library." /> The `<animateMotion>` element rides the marker along the path — synchronized with the trace, since both use the same duration. SVG handled the hard part for you in 2008; we just forgot. ## The basemap question Three options, in increasing complexity: - **Solid color or radial gradient.** Looks "designed," not "geographic." Use for stylized travel content. - **Static Mapbox screenshot** as a PNG layer. Real, recognizable, costs an API call. - **Simplified country/region SVG** rendered behind the route. Recognizable as geography without the tile-server cost. For a video that lives on social, option 1 is usually the right call — the route shape is what people see, the basemap is just texture. For a brand that needs the route placed in geographic context (a delivery service showing "we cover the bay area"), option 2 or 3. ## Distance / elevation counter The number in the corner does heavy lifting. It tells the eye how far along the route is, ticks up alongside the trace, and gives the video a stable focal point. Compute it the same way as the trace: cumulative distance along the path, scaled by the current `t` value. At `t=0.5`, show half the total distance. For multi-day routes, add an elapsed-time counter too — "Day 3, 14:22 elapsed." Two numbers max; more than that and the lower-third gets noisy. <VariableKnobs html={`<style>body{margin:0;background:#0a0a0a;color:#fff;height:340px;display:grid;place-items:center;font-family:ui-sans-serif,system-ui;} .m{position:relative;width:520px;height:300px;border-radius:14px;overflow:hidden;background:radial-gradient(at 30% 40%,{{$BG1}},{{$BG2}} 60%,#080810);} .lbl{position:absolute;top:14px;left:16px;font:600 10px ui-monospace,monospace;letter-spacing:.25em;text-transform:uppercase;color:rgba(255,255,255,.55);} .stat{position:absolute;bottom:14px;right:16px;text-align:right;} .stat .n{font:700 28px ui-sans-serif,system-ui;font-variant-numeric:tabular-nums;letter-spacing:-.02em;color:{{$ACCENT}};} .stat .l{font:600 10px ui-monospace,monospace;letter-spacing:.25em;text-transform:uppercase;color:rgba(255,255,255,.55);margin-top:2px;} </style> <div class="m"> <svg viewBox="0 0 520 300" style="position:absolute;inset:0;width:100%;height:100%;"> <path d="M60 240 Q 120 140 180 160 T 320 80 T 460 40" fill="none" stroke="rgba(255,255,255,.15)" stroke-width="6" stroke-linecap="round"/> <path d="M60 240 Q 120 140 180 160 T 320 80 T 460 40" fill="none" stroke="{{$ACCENT}}" stroke-width="3" stroke-linecap="round" pathLength="1" stroke-dasharray="0.65 1" stroke-dashoffset="0"><animate attributeName="stroke-dasharray" from="0 1" to="1 0" dur="6s" fill="freeze" repeatCount="indefinite"/></path> <circle r="6" fill="#fff" stroke="{{$ACCENT}}" stroke-width="3"><animateMotion dur="6s" repeatCount="indefinite" fill="freeze" path="M60 240 Q 120 140 180 160 T 320 80 T 460 40"/></circle> </svg> <div class="lbl">{{$LABEL}}</div> <div class="stat"><div class="n">{{$VALUE}}</div><div class="l">{{$UNIT}}</div></div> </div>`} knobs={[ { name: "LABEL", label: "Route label", default: "PYRENEES · DAY 3" }, { name: "VALUE", label: "Headline value", default: "62.4 km" }, { name: "UNIT", label: "Unit label", default: "elapsed" }, { name: "ACCENT", label: "Route color", type: "color", default: "#ff3b1f" }, { name: "BG1", label: "Basemap fg", type: "color", default: "#1a3050" }, { name: "BG2", label: "Basemap bg", type: "color", default: "#0a1020" } ]} /> ## From GPX to SVG path A GPX file is XML with a list of `<trkpt>` elements, each with `lat` and `lon`. Parse, project, simplify, render: ```ts const points = parseGpx(gpxXml); // [{ lat, lon }, ...] const projected = points.map(webMercator); const simplified = douglasPeucker(projected, 0.001); const fitted = fitTo({ w: 520, h: 340, padding: 40 })(simplified); const d = `M ${fitted[0].x} ${fitted[0].y} ` + fitted.slice(1).map(p => `L ${p.x} ${p.y}`).join(" "); ``` The `douglasPeucker` simplification matters — a raw GPX track has thousands of points, most of them redundant. Simplification gets you a path that renders cleanly without losing the route's character. ## Use cases - **Travel reels** — daily routes from a tour, weekly summaries. - **Delivery coverage maps** — "we drove 4,200 km this week." - **Cycling / running clubs** — Strava-style highlight reels. - **Sales / business** — "our team visited these 14 cities in Q2." For each, the template is the same; only the path data and the labels change. ## Batch from a coordinate file The same template + a coordinates JSON renders N route videos. For a travel brand publishing a video per trip, the variant cost goes from "an afternoon in After Effects" to "the time it takes the encoder to write the file." [Open the playground](/playground), drop in a coordinate array, watch the marker ride the trace. --- # Tailwind v4 animation cheatsheet — every motion utility, with examples URL: https://hyperframes.video/blog/tailwind-animation-cheatsheet Published: 2026-04-24T09:00:00.000Z Tags: tailwind, css, animation, cheatsheet, tutorial Author: marcus-okafor Tailwind v4 changed how animations are configured — no more `tailwind.config.js`, just CSS tokens. If you came from v3 and your `animate-*` utilities aren't behaving the way you remember, you are not alone. Here is the full motion vocabulary for v4, with live examples and the patterns that hold up in production. ## The built-in animations Tailwind ships five utilities out of the box. They are intentionally limited — the assumption is you'll define your own for anything brand-specific. <InlineSandbox html={`<!doctype html><html><head><style> body{margin:0;background:#0a0a0a;color:#fff;display:grid;place-items:center;height:100vh;font-family:ui-sans-serif,system-ui;padding:24px;box-sizing:border-box;} .grid{display:grid;grid-template-columns:repeat(5,1fr);gap:24px;} .box{display:grid;place-items:center;gap:10px;} .swatch{width:60px;height:60px;border-radius:12px;background:#ff3b1f;} .label{font:600 11px ui-monospace,monospace;letter-spacing:.15em;text-transform:uppercase;color:rgba(255,255,255,.55);} .a-spin{animation:spin 1.5s linear infinite;} .a-ping{animation:ping 1.5s cubic-bezier(0,0,0.2,1) infinite;} .a-pulse{animation:pulse 2s cubic-bezier(0.4,0,0.6,1) infinite;} .a-bounce{animation:bounce 1s infinite;} .a-spin-slow{animation:spin 3s linear infinite;} @keyframes spin{to{transform:rotate(360deg);}} @keyframes ping{75%,100%{transform:scale(2);opacity:0;}} @keyframes pulse{50%{opacity:.5;}} @keyframes bounce{0%,100%{transform:translateY(-25%);animation-timing-function:cubic-bezier(0.8,0,1,1);}50%{transform:none;animation-timing-function:cubic-bezier(0,0,0.2,1);}} </style></head><body> <div class="grid"> <div class="box"><div class="swatch a-spin"></div><div class="label">spin</div></div> <div class="box"><div class="swatch a-ping"></div><div class="label">ping</div></div> <div class="box"><div class="swatch a-pulse"></div><div class="label">pulse</div></div> <div class="box"><div class="swatch a-bounce"></div><div class="label">bounce</div></div> <div class="box"><div class="swatch a-spin-slow"></div><div class="label">custom</div></div> </div> </body></html>`} height={220} caption="The four built-in animations + one custom ('spin-slow')." /> | Utility | Effect | Use for | |---|---|---| | `animate-spin` | 360° rotation, 1s linear, infinite | Loading spinners only | | `animate-ping` | Scale + fade, 1s, infinite | Notification dots, ripple states | | `animate-pulse` | Opacity fade in/out, 2s | Skeleton loaders | | `animate-bounce` | Vertical bounce, 1s | Scroll indicators | | `animate-none` | Disables animation | Conditional / motion-reduced contexts | That's it for built-ins. Everything else is custom. ## Defining custom animations in v4 In v4 you define keyframes and animation utilities directly in CSS using `@theme`: ```css @import "tailwindcss"; @theme { --animate-shake: shake 0.4s cubic-bezier(.36,.07,.19,.97) both; --animate-fade-in: fade-in 0.3s ease-out; --animate-slide-up: slide-up 0.4s cubic-bezier(.34,1.56,.64,1) both; @keyframes shake { 10%, 90% { transform: translate3d(-1px, 0, 0); } 20%, 80% { transform: translate3d(2px, 0, 0); } 30%, 50%, 70% { transform: translate3d(-4px, 0, 0); } 40%, 60% { transform: translate3d(4px, 0, 0); } } @keyframes fade-in { from { opacity: 0; } } @keyframes slide-up { from { opacity: 0; transform: translateY(8px); } } } ``` Now `<div class="animate-shake">` works. The `--animate-*` token both defines the animation and registers the utility class — one declaration, no separate plugin step. ## The motion-reduced variant `motion-safe:` and `motion-reduce:` honor the user's OS preference. Use them on every non-essential animation: ```html <div class="motion-safe:animate-fade-in motion-reduce:animate-none"> Hello </div> ``` The rule we follow: any animation that is decorative — fades, slides, attention-grabbers — gets `motion-safe:`. Any animation that conveys state — a loading spinner, a "you have new mail" pulse — stays on regardless. ## Transitions vs. animations (the v4 answer) Tailwind exposes both. The distinction: - **Transitions** (`transition-*`) interpolate between two states (hover, focus, class change). Cheap, interruptible, the default for UI. - **Animations** (`animate-*`) run a keyframe sequence regardless of state. Use when you need entry/exit motion or a looping effect. Most UI work is transitions. Reserve animations for entrance/exit and looping decorations. ```html  <button class="transition-transform duration-200 hover:scale-105"> Press me </button>  <div class="animate-slide-up"> Hello </div> ``` ## The full motion utility list <CodeTabs tabs={[ { label: "Transitions", lang: "html", code: ` transition-all /* all properties */ transition-colors /* color, bg, border */ transition-opacity transition-shadow transition-transform  duration-75 / 100 / 150 / 200 / 300 / 500 / 700 / 1000  ease-linear ease-in ease-out ease-in-out  delay-75 / 100 / 150 / 200 / 300 / 500 / 700 / 1000`, }, { label: "Transforms", lang: "html", code: ` scale-0 / 50 / 75 / 90 / 95 / 100 / 105 / 110 / 125 / 150 scale-x-* scale-y-*  rotate-0 / 1 / 2 / 3 / 6 / 12 / 45 / 90 / 180 -rotate-* /* negative */  translate-x-* translate-y-* /* spacing scale */ translate-x-px translate-x-full translate-x-1/2  skew-x-* skew-y-*`, }, { label: "Common patterns", lang: "html", code: ` <a class="transition-transform duration-200 hover:-translate-y-0.5">  <div class="animate-fade-in">  <button class="transition-transform active:scale-95">  <a class="transition-colors duration-300 hover:text-orange-500">  <div class="animate-slide-in-r">`, }, ]} caption="Reference for Tailwind v4 motion utilities." height={400} /> ## Patterns that work in v4 Three patterns we reach for constantly: ### 1. The lift-on-hover ```html <a class="block rounded-xl border p-6 transition-all duration-200 hover:-translate-y-0.5 hover:shadow-lg"> ``` Two transforms (`-translate-y` + `shadow`), 200ms, all properties. The shadow change is what sells the depth. ### 2. The press-state button ```html <button class="rounded-lg bg-red-500 px-4 py-2 text-white transition-all duration-150 hover:bg-red-600 active:scale-95"> ``` `active:scale-95` gives a tactile press without a JS handler. The browser handles `:active`; Tailwind handles the scale. ### 3. The staggered list For list entries that fade in with a stagger, use `animation-delay` via inline style: ```html {items.map((item, i) => ( <div class="animate-fade-in" style={`animation-delay:${i * 50}ms`}> {item.text} </div> ))} ``` 50ms is the sweet spot — visible cascade, no awkward delay before the last item. ## Rendering Tailwind animations to MP4 Tailwind animations are CSS animations under the hood — they render to MP4 through the [HyperFrames pipeline](/tools/html-to-video) just like any other CSS animation. The one caveat: `motion-reduce` will follow the rendering browser's preference, so set the rendering environment's `prefers-reduced-motion` to `no-preference` for production renders. [Open the playground](/playground), drop in a Tailwind component, scrub through the motion timeline. --- # Animated comparison tables that convert (pricing & feature grids) URL: https://hyperframes.video/blog/animated-comparison-table Published: 2026-04-22T09:00:00.000Z Tags: css, table, animation, pricing, tutorial Author: marcus-okafor The comparison table is the highest-stakes piece of typography on most SaaS pages. It is also the most-ignored by motion design. Most pricing pages ship a static three-column table and call it done — and lose conversions because the table reads like a spreadsheet instead of a recommendation. A bit of motion, applied carefully, fixes this. Not "make the table dance" — make the table draw the eye to the right column, reward the scroll, and make the differences between plans feel like a story instead of a grid. ## What "animated" should mean here Three jobs, no more: 1. **Reveal in cascade** when the table scrolls into view. Each row appears 80ms after the one above it. 2. **Highlight the recommended column** with a subtle pulse on the price. 3. **Animate the checkmarks** drawing themselves on first paint — a fast, decisive stroke. Anything beyond that — color-shifting rows, sliding tooltips, parallaxing prices — undermines the table's job, which is to be readable. ## The cascade reveal A staggered fade-in down the rows. Each row uses the same animation; the `animation-delay` is what differs. <InlineSandbox html={`<!doctype html> <html><head><style> body{margin:0;background:#f6f5f1;color:#0a0a0a;display:grid;place-items:center;height:100vh;font-family:ui-sans-serif,system-ui;} table{border-collapse:collapse;width:520px;background:#fff;border-radius:12px;overflow:hidden;box-shadow:0 12px 32px rgba(0,0,0,.08);} th,td{padding:14px 16px;text-align:left;font-size:14px;border-bottom:1px solid #ece9e0;} th{font:600 11px ui-monospace,monospace;letter-spacing:.18em;text-transform:uppercase;color:#6b6862;background:#f9f8f4;} .col-rec{background:rgba(255,59,31,.05);} .price{font-weight:700;font-size:18px;} .price-rec{color:#ff3b1f;} .row{opacity:0;animation:in .4s ease-out forwards;} @keyframes in { to { opacity: 1; transform: translateY(0); } } .row{transform:translateY(8px);} .row:nth-child(1){animation-delay:0ms;} .row:nth-child(2){animation-delay:80ms;} .row:nth-child(3){animation-delay:160ms;} .row:nth-child(4){animation-delay:240ms;} .row:nth-child(5){animation-delay:320ms;} .check{width:14px;height:14px;} .dim{color:#bbb;} </style></head><body> <table> <thead><tr class="row"><th></th><th>Starter</th><th class="col-rec">Pro</th><th>Team</th></tr></thead> <tbody> <tr class="row"><td>Price</td><td class="price">$0</td><td class="col-rec price price-rec">$29</td><td class="price">$99</td></tr> <tr class="row"><td>Renders / mo</td><td>100</td><td class="col-rec">5,000</td><td>Unlimited</td></tr> <tr class="row"><td>Team seats</td><td>1</td><td class="col-rec">5</td><td>20</td></tr> <tr class="row"><td>API access</td><td class="dim">—</td><td class="col-rec">✓</td><td>✓</td></tr> </tbody> </table> </body></html>`} height={340} caption="Cascade reveal: each row 80ms after the previous. The Pro column tints subtly." /> The animation runs once on mount. Don't loop it. A pricing table that keeps redrawing itself feels like a slot machine. ## The highlighted column The "recommended" column gets three signals: 1. **A subtle tint** behind the cells (5% accent color). 2. **A "POPULAR" pill** above the column header. 3. **The price in the accent color**, slightly larger. Combined, the eye lands on that column first, every time. Without them, viewers default to the cheapest column, which is not what you want. ## The checkmark animation For feature-grid rows, a check or X per cell. Animate the checks drawing on entry: ```html <svg viewBox="0 0 14 14" class="check"> <path d="M3 7 L6 10 L11 4" stroke="currentColor" stroke-width="2" fill="none" stroke-linecap="round" stroke-linejoin="round" stroke-dasharray="14" stroke-dashoffset="14"> <animate attributeName="stroke-dashoffset" from="14" to="0" dur="0.3s" begin="0.4s" fill="freeze"/> </path> </svg> ``` Set the `begin` per row so the checks draw in cascade with the row reveals. Three details that matter: - **`stroke-linecap: round`** — sharp ends read as broken pixels at small sizes. - **`begin` delay aligned to row delay + 100ms** — the row settles, then the checks draw. - **No animation on the X mark** — only the positive answer animates. The X is just present. <VariableKnobs html={`<style> body{margin:0;background:{{$BG}};color:{{$FG}};display:grid;place-items:center;height:380px;font-family:ui-sans-serif,system-ui;} table{border-collapse:collapse;width:480px;background:{{$CARD}};border-radius:12px;overflow:hidden;box-shadow:0 12px 32px rgba(0,0,0,.08);} th,td{padding:12px 14px;text-align:left;font-size:13px;border-bottom:1px solid rgba(127,127,127,.15);} th{font:600 10px ui-monospace,monospace;letter-spacing:.18em;text-transform:uppercase;color:rgba(127,127,127,.8);} .rec{background:color-mix(in srgb,{{$ACCENT}} 6%,transparent);} .price{font-weight:700;font-size:18px;} .price-rec{color:{{$ACCENT}};font-size:20px;} .pill{position:absolute;top:-10px;right:30px;background:{{$ACCENT}};color:#fff;padding:3px 9px;border-radius:999px;font:700 9px ui-monospace,monospace;letter-spacing:.15em;text-transform:uppercase;} .wrap{position:relative;} .dim{color:rgba(127,127,127,.5);} </style> <div class="wrap"> <table> <thead><tr><th></th><th>{{$P1}}</th><th class="rec">{{$P2}}</th><th>{{$P3}}</th></tr></thead> <tbody> <tr><td>Price</td><td class="price">{{$PR1}}</td><td class="rec price price-rec">{{$PR2}}</td><td class="price">{{$PR3}}</td></tr> <tr><td>Renders / month</td><td>100</td><td class="rec">5,000</td><td>Unlimited</td></tr> <tr><td>Team seats</td><td>1</td><td class="rec">5</td><td>20</td></tr> <tr><td>Priority support</td><td class="dim">—</td><td class="rec">✓</td><td>✓</td></tr> </tbody> </table> <div class="pill">popular</div> </div>`} knobs={[ { name: "P1", label: "Plan 1", default: "Starter" }, { name: "P2", label: "Plan 2 (rec)", default: "Pro" }, { name: "P3", label: "Plan 3", default: "Team" }, { name: "PR1", label: "Price 1", default: "$0" }, { name: "PR2", label: "Price 2", default: "$29" }, { name: "PR3", label: "Price 3", default: "$99" }, { name: "ACCENT", label: "Accent", type: "color", default: "#ff3b1f" }, { name: "CARD", label: "Card bg", type: "color", default: "#ffffff" }, { name: "BG", label: "Page bg", type: "color", default: "#f6f5f1" }, { name: "FG", label: "Text", type: "color", default: "#0a0a0a" } ]} /> ## The mobile concession Comparison tables fail on mobile. Three rescue patterns, in increasing complexity: - **Horizontal scroll** with the first column pinned. Works for tables under five plans. - **Column-per-card** stacked vertically. Each plan becomes its own card. - **Toggle between plans** with a segmented control above a single column view. For pricing pages, pattern 2 (stacked cards) converts best in our testing. The user scrolls top-down, sees one plan at a time, and the comparison happens in working memory. ## Rendering to MP4 (for ads) A pricing-table video is one of the highest-ROI social ads for SaaS. Render the cascade reveal at `1080×1920` (Reels), `1080×1080` (in-feed), and `1920×1080` (YouTube pre-roll) from the same template. Loop the reveal every 4 seconds with a 1-second hold at the end. The viewer should see the full table at rest for at least one full second before the loop restarts. ## What not to animate A long list. Don't: - Animate the feature labels (left column). They are the lookup column; they should be stable. - Add hover animations on individual cells. Cells aren't actionable. - Animate "Buy" buttons separately. They are part of the table; treat them as cells. - Add scroll-tied parallax to anything in the table. Tables are not heroes. A comparison table is functional typography. Motion supports the function, never decorates it. [Open the playground](/playground), drop in your pricing tiers, render the ad cuts. --- # Slack-style notification toast animation in CSS URL: https://hyperframes.video/blog/notification-toast-animation Published: 2026-04-20T09:00:00.000Z Tags: css, toast, notification, ui, tutorial Author: ren-park The notification toast is one of the most-implemented UI elements and one of the most-misjudged. Most toasts are either too aggressive (sliding in from a corner, blocking content, demanding action) or too quiet (appearing without motion, easy to miss). The Slack toast — discreet slide-in, sits for a few seconds, fades out, hover-pauses — has been the gold standard since 2015 for good reasons. Here is how to build it from scratch in CSS, including the details that make the difference between "a notification" and "a Slack-style notification." ## The four motion beats A good toast has four: 1. **Slide-in** (200–300ms, ease-out). From off-screen edge, into final position. Decisive. 2. **Settle** (50ms, no overshoot for toasts). Quieter than a button press. 3. **Hover-pause** (state-driven). When the cursor enters, freeze the dismiss timer. 4. **Slide-out** (200ms, ease-in). Reverse direction, slightly faster than the entry. The total runtime — from spawn to dismissed — is 4 to 6 seconds for an info-level toast, 8+ for an action-required toast. <InlineSandbox html={`<!doctype html><html><head><style> body{margin:0;background:#0a0a0a;color:#fff;height:100vh;font-family:ui-sans-serif,system-ui;padding:24px;box-sizing:border-box;display:grid;place-items:end;} .toast{position:relative;background:#fff;color:#0a0a0a;border-radius:10px;padding:14px 16px 14px 14px;box-shadow:0 12px 32px rgba(0,0,0,.18);display:grid;grid-template-columns:24px 1fr;gap:12px;align-items:start;max-width:340px;animation:in .25s cubic-bezier(.2,.7,.2,1) both,settle 2.5s linear forwards 1s;} @keyframes in { from { transform: translateX(120%); opacity: 0; } to { transform: translateX(0); opacity: 1; } } @keyframes settle { 0%, 90% { transform: translateX(0); opacity: 1; } 100% { transform: translateX(120%); opacity: 0; } } .icon{width:24px;height:24px;border-radius:50%;background:#1f8a5b;display:grid;place-items:center;color:#fff;font-weight:700;font-size:14px;} .title{font-weight:600;font-size:14px;margin-bottom:2px;} .body{font-size:13px;color:#444;line-height:1.4;} .dismiss{position:absolute;top:8px;right:8px;background:none;border:0;color:#999;font-size:14px;cursor:pointer;line-height:1;} .bar{position:absolute;bottom:0;left:0;right:0;height:2px;background:#1f8a5b;animation:bar 3.5s linear forwards;border-radius:0 0 10px 10px;transform-origin:left;} @keyframes bar { from { transform: scaleX(1); } to { transform: scaleX(0); } } </style></head><body> <div class="toast"> <div class="icon">✓</div> <div><div class="title">Render queued</div><div class="body">5 of 5 variants will be ready in ~2 minutes.</div></div> <button class="dismiss">×</button> <div class="bar"></div> </div> </body></html>`} height={240} caption="Toast slide-in, settle, dismiss bar, slide-out. Loops for demo." /> ## The slide direction Toasts spawn from one of four corners. The right-bottom and right-top are the conventional defaults; both work, the bottom is slightly better because the eye is more often there. Direction of motion: **always from off-screen toward the center of the viewport**. Bottom-right toast slides up-and-left? No — it slides in from the right edge. Bottom-center toast slides up. Top-right toast slides from the right. The motion vector points toward where it will rest, with the off-screen origin matching the closest edge. ## The progress bar (optional but high-leverage) A 2px bar at the bottom of the toast that drains over the auto-dismiss duration. Two reasons it's worth adding: 1. **Feedback.** The user knows the toast will dismiss itself. Without the bar, they don't know if they need to act. 2. **Pause indicator.** When the mouse enters, the bar's animation pauses. The user sees that hovering keeps the toast alive. Implement with `transform: scaleX()` and `transform-origin: left` so the bar drains rightward. ```css .toast-bar { transform-origin: left; animation: drain 4s linear forwards; } .toast:hover .toast-bar { animation-play-state: paused; } @keyframes drain { to { transform: scaleX(0); } } ``` ## Stacking When two toasts coexist, the second pushes the first up (or down, depending on origin). The position transition is a 200ms ease-out on `transform: translateY()`. The rule for stack height: max 3 toasts visible. Beyond that, older toasts dismiss themselves immediately to make room. A stack of 8 toasts is a UX failure regardless of how smoothly they animate. <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:#fff;height:340px;font-family:ui-sans-serif,system-ui;padding:24px;box-sizing:border-box;display:grid;place-items:end;} .toast{position:relative;background:{{$CARD}};color:{{$FG}};border-radius:{{$RADIUS}}px;padding:14px 16px 14px 14px;box-shadow:0 12px 32px rgba(0,0,0,.18);display:grid;grid-template-columns:24px 1fr;gap:12px;max-width:340px;animation:in .25s cubic-bezier(.2,.7,.2,1) both,settle 3.5s linear forwards 0.8s;} @keyframes in { from { transform: translateX(120%); opacity: 0; } to { transform: translateX(0); opacity: 1; } } @keyframes settle { 0%,85%{transform:translateX(0);opacity:1;} 100%{transform:translateX(120%);opacity:0;} } .icon{width:24px;height:24px;border-radius:50%;background:{{$ACCENT}};display:grid;place-items:center;color:#fff;font-weight:700;font-size:14px;} .title{font-weight:600;font-size:14px;} .body{font-size:13px;color:rgba(0,0,0,.6);line-height:1.4;margin-top:2px;} .bar{position:absolute;bottom:0;left:0;right:0;height:2px;background:{{$ACCENT}};transform-origin:left;animation:bar 4.2s linear forwards;border-radius:0 0 {{$RADIUS}}px {{$RADIUS}}px;} @keyframes bar { to { transform: scaleX(0); } } </style> <div class="toast"><div class="icon">{{$ICON}}</div><div><div class="title">{{$TITLE}}</div><div class="body">{{$BODY}}</div></div><div class="bar"></div></div>`} knobs={[ { name: "TITLE", label: "Title", default: "Render queued" }, { name: "BODY", label: "Body", default: "5 of 5 variants will be ready in ~2 minutes." }, { name: "ICON", label: "Icon", default: "✓" }, { name: "ACCENT", label: "Accent", type: "color", default: "#1f8a5b" }, { name: "CARD", label: "Card bg", type: "color", default: "#ffffff" }, { name: "FG", label: "Text color", type: "color", default: "#0a0a0a" }, { name: "BG", label: "Page bg", type: "color", default: "#0a0a0a" }, { name: "RADIUS", label: "Radius", type: "number", default: "10", min: 0, max: 24, step: 1 } ]} /> ## Toast severity levels Four levels, each with a distinct visual treatment: | Level | Icon | Accent | Duration | Use | |---|---|---|---|---| | Info | ⓘ | Brand blue | 4s | Background events, save confirmations | | Success | ✓ | Green | 3s | Action completed | | Warning | ⚠ | Amber | 6s | Recoverable problem, no action needed | | Error | ✕ | Red | Sticky (manual dismiss) | Action failed, requires acknowledgement | The "sticky on error" rule is the one most apps get wrong. An error toast that disappears in 4 seconds is information the user often misses. ## Accessibility, briefly - **`role="status"`** for info/success. - **`role="alert"`** for warning/error — screen readers interrupt to read these. - **`aria-live="polite"`** on the toast container. - **Honor `prefers-reduced-motion`** — replace slide with a 100ms fade. These are five lines of code that make the toast usable for ~15% more of your users. ## Rendering to MP4 (for onboarding videos) The toast pattern lifts directly into the [render pipeline](/tools/html-to-video). Two use cases: - **Onboarding walkthroughs** — render a toast sequence showing the user what notifications they'll see. - **Marketing site demos** — a fake toast on a hero shot, demoing the product without requiring a real login. Set the loop to 4 seconds (in + settle + out + 500ms gap) and the toast reads as ambient motion on the hero. [Open the playground](/playground), drop the toast in, render the onboarding clip. --- # Generative mesh gradients in pure CSS (Stripe-style backgrounds) URL: https://hyperframes.video/blog/generative-mesh-gradients Published: 2026-04-18T09:00:00.000Z Tags: css, gradient, generative, mesh, tutorial Author: marcus-okafor The "mesh gradient" — those painterly, multi-color backgrounds you see on Stripe, Linear, and every well-designed SaaS hero of the last three years — looks like it requires a design tool. It doesn't. Three radial gradients, a heavy blur, an SVG noise overlay, and you have one. Add a deterministic seed and you have an infinite supply of unique backgrounds for variant videos. This is the engineering build: pure CSS, no libraries, reproducible across renders. ## Why mesh gradients work The eye reads two-color linear gradients as "design from 2014." It reads three-or-more-color radial gradients with blur as "this product probably has a Series B." The shift is partly cultural (the look became the standard around 2019) and partly visual — multi-color soft fields feel hand-painted because real painted gradients are not linear. The look has four components: 1. **3–5 radial overlays**, each in a different brand color. 2. **Positioned in non-grid coordinates** — never `50% 50%`, always something like `27% 13%`. 3. **Heavy `filter: blur(40-80px)`** to smear the overlays together. 4. **SVG noise overlay at 8-12% opacity** to break banding. That's the full recipe. <InlineSandbox html={`<!doctype html><html><head><style> :root { --c1:#ff3b1f; --c2:#ff9b00; --c3:#7b2cff; --c4:#1f5fff; --c5:#1f8a5b; } body{margin:0;height:100vh;overflow:hidden;background:#0a0a0a;} .mesh{position:fixed;inset:-10%;background: radial-gradient(at 27% 13%, var(--c1) 0%, transparent 50%), radial-gradient(at 81% 22%, var(--c2) 0%, transparent 45%), radial-gradient(at 71% 83%, var(--c3) 0%, transparent 50%), radial-gradient(at 19% 71%, var(--c4) 0%, transparent 50%), radial-gradient(at 50% 50%, var(--c5) 0%, transparent 40%), #0a0a0a; filter:blur(60px) saturate(1.1); animation:m 20s ease-in-out infinite alternate; } @keyframes m { 0% { transform: scale(1.1) translate(0,0) rotate(0deg); } 100% { transform: scale(1.3) translate(-3%,2%) rotate(8deg); } } .noise{position:fixed;inset:0;opacity:.10;mix-blend-mode:overlay;pointer-events:none;} .hd{position:relative;height:100vh;display:grid;place-items:center;color:#fff;font:600 56px ui-sans-serif,system-ui;letter-spacing:-.02em;mix-blend-mode:plus-lighter;} </style></head><body> <div class="mesh"></div> <svg class="noise"><filter id="n"><feTurbulence type="fractalNoise" baseFrequency=".9" numOctaves="2"/></filter><rect width="100%" height="100%" filter="url(#n)"/></svg> <div class="hd">Series A — closed</div> </body></html>`} height={340} caption="Five-color mesh gradient with noise. Drifts on a 20-second loop." /> ## The seeding trick — reproducible gradients For a video pipeline that renders N variants, you want each variant to have a unique gradient but in a repeatable way. The hack: use the variant's ID as a numeric seed, derive the gradient positions and rotations from it. ```ts function gradientForSeed(seed: string) { const h = hash(seed); // any string hash → uint32 const pick = (offset: number, range: number) => ((h >> offset) & 0xff) % range; return { c1Pos: { x: 10 + pick(0, 60), y: 10 + pick(8, 60) }, c2Pos: { x: 30 + pick(16, 60), y: 10 + pick(24, 60) }, c3Pos: { x: 30 + pick(32, 60), y: 40 + pick(40, 60) }, rotate: pick(48, 30) - 15, }; } ``` Plug those numbers into the CSS variables. Every variant with the same seed renders the same gradient; different seeds give different gradients. <VariableKnobs html={`<style> body{margin:0;height:380px;overflow:hidden;background:#0a0a0a;position:relative;} .mesh{position:absolute;inset:-10%;background: radial-gradient(at {{$P1X}}% {{$P1Y}}%, {{$C1}} 0%, transparent 50%), radial-gradient(at {{$P2X}}% {{$P2Y}}%, {{$C2}} 0%, transparent 45%), radial-gradient(at {{$P3X}}% {{$P3Y}}%, {{$C3}} 0%, transparent 50%), {{$BG}}; filter:blur({{$BLUR}}px) saturate({{$SAT}}); } .noise{position:absolute;inset:0;opacity:.08;mix-blend-mode:overlay;pointer-events:none;} .lbl{position:absolute;left:24px;bottom:24px;color:#fff;font:600 36px ui-sans-serif,system-ui;letter-spacing:-.02em;} </style> <div class="mesh"></div> <svg class="noise"><filter id="n"><feTurbulence type="fractalNoise" baseFrequency=".9" numOctaves="2"/></filter><rect width="100%" height="100%" filter="url(#n)"/></svg> <div class="lbl">{{$LABEL}}</div>`} knobs={[ { name: "LABEL", label: "Label", default: "Q2 launch" }, { name: "C1", label: "Color 1", type: "color", default: "#ff3b1f" }, { name: "C2", label: "Color 2", type: "color", default: "#7b2cff" }, { name: "C3", label: "Color 3", type: "color", default: "#1f5fff" }, { name: "BG", label: "Background", type: "color", default: "#0a0a0a" }, { name: "P1X", label: "C1 X%", type: "number", default: "27", min: 0, max: 100, step: 1 }, { name: "P1Y", label: "C1 Y%", type: "number", default: "13", min: 0, max: 100, step: 1 }, { name: "P2X", label: "C2 X%", type: "number", default: "81", min: 0, max: 100, step: 1 }, { name: "P2Y", label: "C2 Y%", type: "number", default: "22", min: 0, max: 100, step: 1 }, { name: "P3X", label: "C3 X%", type: "number", default: "71", min: 0, max: 100, step: 1 }, { name: "P3Y", label: "C3 Y%", type: "number", default: "83", min: 0, max: 100, step: 1 }, { name: "BLUR", label: "Blur (px)", type: "number", default: "60", min: 0, max: 120, step: 2 }, { name: "SAT", label: "Saturation", default: "1.1" } ]} /> ## The animation: drift, don't dance The motion on a mesh gradient is the difference between "designed background" and "Windows screensaver." Two rules: 1. **Use `transform`, not background-position.** Animating `background-position` makes the gradients visibly slide; animating `transform` on the gradient container makes the whole field drift, which is what you want. 2. **20+ second loops only.** A 5-second mesh animation reads as nervous. A 30-second loop reads as ambient. The animation we use across landing pages: ```css .mesh { animation: drift 24s ease-in-out infinite alternate; } @keyframes drift { 0% { transform: scale(1.1) translate(0, 0) rotate(0deg); } 100% { transform: scale(1.3) translate(-3%, 2%) rotate(6deg); } } ``` The `alternate` direction is critical. Without it, the gradient snaps back at the end of each loop, which looks like a glitch. With `alternate`, the motion is a continuous breathe-in / breathe-out. ## The noise overlay (the unsung hero) A mesh gradient without noise on top renders fine in the browser and looks washed-out in MP4. The video codec bands the smooth gradient regions; noise breaks the bands. ```html <svg class="noise"> <filter id="n"> <feTurbulence type="fractalNoise" baseFrequency=".9" numOctaves="2"/> </filter> <rect width="100%" height="100%" filter="url(#n)"/> </svg> ``` `mix-blend-mode: overlay` blends the noise into the gradient instead of sitting on top of it. 8–12% opacity is the right range — visible texture without becoming the dominant element. ## Rendering to MP4 The mesh gradient is pure CSS, so it renders through the [HyperFrames pipeline](/tools/html-to-video) without special handling. The render is deterministic because the gradient positions are derived from a seed, not from `Math.random()`. Three practical sizes: - **1080×1080** — Instagram post background. - **1080×1920** — Reels / Shorts background. - **2560×1440** — desktop wallpaper drop or YouTube thumbnail. Render once per seed; cache the output; reuse across the campaign. ## When mesh gradients age The look will eventually go the way of the "blurry purple-pink" look did. The hedge: use brand-specific colors rather than the default "purple-pink-blue" palette. The structure (mesh + blur + noise) is durable; the specific palette is what dates. [Open the playground](/playground), seed your variant IDs, render a background per campaign. --- # Skeleton loaders that don't feel cheap (CSS + timing) URL: https://hyperframes.video/blog/skeleton-loader-animation Published: 2026-04-16T09:00:00.000Z Tags: css, skeleton, loading, ui, tutorial Author: ren-park The skeleton loader replaced the spinner around 2018 and the entire web got slightly less anxious. Facebook deserves the credit; the idea was simple — show the shape of the content while it loads — and the upgrade in perceived performance was real. Most skeleton implementations are still bad. Either the skeleton shapes don't match the content that replaces them (causing layout jolt), or the shimmer animation is too fast (looks frantic), or the colors are too gray (looks like a 404). Here's how to do skeletons that read as design. ## What a good skeleton actually is Three principles, each non-negotiable: 1. **The skeleton matches the final content's geometry exactly.** Same height, same width, same border-radius. When the real content arrives, nothing reflows. 2. **The shimmer is slow.** 1.5 to 2.5 seconds per cycle. Anything faster reads as urgent — and "loading" should not feel urgent. 3. **The colors are part of your palette.** Not gray-on-gray. A subtle tint of your card background works; pure `#e0e0e0` looks like a Bootstrap demo. Get those three right and the skeleton stops being a placeholder and starts being a part of the design. <InlineSandbox html={`<!doctype html><html><head><style> body{margin:0;background:#f6f5f1;color:#0a0a0a;display:grid;place-items:center;height:100vh;font-family:ui-sans-serif,system-ui;} .card{width:340px;background:#fff;border-radius:14px;padding:18px;box-shadow:0 8px 24px rgba(0,0,0,.06);} .skel{background:linear-gradient(90deg,#eee 0%,#f5f5f5 50%,#eee 100%);background-size:200% 100%;animation:sh 2s ease-in-out infinite;border-radius:6px;} .row{display:flex;gap:12px;align-items:center;margin-bottom:14px;} .avatar{width:40px;height:40px;border-radius:50%;} .l1{height:14px;width:60%;margin-bottom:6px;} .l2{height:10px;width:40%;} .line{height:12px;margin-bottom:8px;} .line.last{width:80%;} @keyframes sh { 0%{background-position:200% 0;} 100%{background-position:-200% 0;} } </style></head><body> <div class="card"> <div class="row"> <div class="skel avatar"></div> <div style="flex:1;"><div class="skel l1"></div><div class="skel l2"></div></div> </div> <div class="skel line"></div> <div class="skel line"></div> <div class="skel line last"></div> </div> </body></html>`} height={280} caption="A user-card skeleton — avatar, two name lines, three body lines. The shimmer is slow and warm." /> ## The shimmer technique The shimmer is a moving gradient. The trick is making it move across `background-position`, not by translating an overlay element. The single-element technique: ```css .skeleton { background: linear-gradient(90deg, var(--skel-base) 0%, var(--skel-hl) 50%, var(--skel-base) 100%); background-size: 200% 100%; animation: shimmer 2s ease-in-out infinite; } @keyframes shimmer { 0% { background-position: 200% 0; } 100% { background-position: -200% 0; } } ``` The `--skel-base` and `--skel-hl` are the only variables. For light themes: ```css --skel-base: #ecebe5; --skel-hl: #f5f4ee; ``` Warm enough to match cream backgrounds, not gray. For dark themes, invert: `#1a1a1a` base, `#252525` highlight. ## Skeleton shapes vs. real content The killer detail is dimension matching. If the real content is a 14px-tall name row, the skeleton row is 14px tall — not 12, not 16. If the avatar is `40px` round, the skeleton avatar is `40px` round. Take the real component and replace every `<text>` with a skeleton `<div>` of the same size: <CodeTabs tabs={[ { label: "Real", lang: "tsx", code: `<div className="card"> <div className="row"> <div> <div className="name">{user.name}</div> <div className="title">{user.title}</div> </div> </div> <p className="bio">{user.bio}</p> </div>`, }, { label: "Skeleton", lang: "tsx", code: `<div className="card"> <div className="row"> <div> </div> </div> </div>`, }, ]} caption="Same component, real vs. skeleton — every element matched." /> ## Where the shimmer goes wrong Three common failures: 1. **Animation too fast.** 600ms loops feel like a spinner. 2-second loops feel like ambient breathing. 2. **Animation too contrasty.** A bright white highlight on a dark gray base looks like a flashlight. Keep the highlight 10–20% brighter than the base; no more. 3. **Skeleton everywhere.** Skeletoning the entire viewport feels worse than a spinner because the eye tries to read every shape. Skeleton only the primary content; let secondary UI stay in its default state. ## The "stop animating" timing A skeleton that has been on screen for over 3 seconds should switch from shimmering to static. The shimmer implies "data is coming"; if data isn't coming after 3 seconds, something is wrong, and continuing to shimmer feels like a lie. ```js setTimeout(() => container.classList.add('paused-shimmer'), 3000); ``` ```css .paused-shimmer .skeleton { animation: none; opacity: .85; } ``` After 6 seconds with no data, show an error state. Not a longer skeleton. Not a spinner. An error with a retry button. <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:{{$FG}};display:grid;place-items:center;height:340px;font-family:ui-sans-serif,system-ui;} .card{width:340px;background:{{$CARD}};border-radius:14px;padding:18px;box-shadow:0 8px 24px rgba(0,0,0,.06);} .skel{background:linear-gradient(90deg,{{$BASE}} 0%,{{$HL}} 50%,{{$BASE}} 100%);background-size:200% 100%;animation:sh {{$SPEED}}s ease-in-out infinite;border-radius:6px;} @keyframes sh { 0%{background-position:200% 0;} 100%{background-position:-200% 0;} } .row{display:flex;gap:12px;align-items:center;margin-bottom:14px;} .avatar{width:40px;height:40px;border-radius:50%;} .l1{height:14px;width:60%;margin-bottom:6px;} .l2{height:10px;width:40%;} .line{height:12px;margin-bottom:8px;} .line.last{width:80%;} </style> <div class="card"> <div class="row"><div class="skel avatar"></div><div style="flex:1;"><div class="skel l1"></div><div class="skel l2"></div></div></div> <div class="skel line"></div> <div class="skel line"></div> <div class="skel line last"></div> </div>`} knobs={[ { name: "BASE", label: "Skeleton base", type: "color", default: "#ecebe5" }, { name: "HL", label: "Skeleton highlight", type: "color", default: "#f5f4ee" }, { name: "CARD", label: "Card bg", type: "color", default: "#ffffff" }, { name: "BG", label: "Page bg", type: "color", default: "#f6f5f1" }, { name: "FG", label: "Text color", type: "color", default: "#0a0a0a" }, { name: "SPEED", label: "Animation (s)", default: "2.0" } ]} /> ## When skeletons are wrong Three contexts where a skeleton hurts more than a spinner or a delay: - **Sub-300ms loads.** If the request comes back in 200ms, the skeleton flashes and the eye registers it as a glitch. Either use no loading state at all, or delay the skeleton's appearance by 200ms with `animation-delay`. - **Forms.** A skeleton in place of a form field is confusing; the user expects to type. Show the field disabled with a spinner inside. - **Single-source-of-truth content** like a price or a balance. A skeleton "$ — — —" is worse than "Loading..." because the eye reads it as "your balance is missing." For everything else — feeds, lists, cards, panels — skeletons beat spinners every time. ## Rendering skeleton states to MP4 For onboarding videos and marketing demos, render the skeleton-to-content transition: 2 seconds of skeleton, then a crossfade to the loaded content. The transition is 200ms. This is the single most-effective demo pattern for products that load data: it tells the viewer "we have a loading state we care about" without belaboring the point. [Open the playground](/playground), drop a skeleton card in, render the loaded-state transition. --- # MP4 vs WebM vs GIF in 2026 — the practical guide for product engineers URL: https://hyperframes.video/blog/mp4-vs-webm-vs-gif-2026 Published: 2026-04-14T09:00:00.000Z Tags: mp4, webm, gif, video, engineering, comparison Author: kira-tanaka The "which video format" question used to have a complicated answer involving fallbacks, codec capability detection, and a `<video>` element with three `<source>` children. In 2026 the answer is mostly "MP4," with two specific exceptions where you should reach for WebM or GIF instead. Here is the current state, written for product engineers shipping video on a website, in an app, or as a downloaded asset. ## TL;DR | Use case | Format | Why | |---|---|---| | Marketing site hero loop | **MP4 (H.264)** | Universal support, hardware decoded, autoplays muted | | Inline product demo in app | **MP4 (H.264)** | Same | | Transparent overlay video | **WebM (VP9)** | Only format with alpha channel that works in browsers | | Email-embeddable motion | **GIF** | The only format email clients render in-line | | User-downloadable export | **MP4 (H.264)** | Plays in QuickTime, VLC, every editor | | High-resolution archival | **MP4 (H.265)** or **AV1** | Better compression at the same quality | That's 80% of decisions. The rest of the post unpacks the edge cases. ## Where the formats stand in 2026 **MP4 / H.264** — the universal baseline. Every browser, every editing tool, every social platform. Hardware-decoded on every device made in the last decade. Lossy but tunable; modern encoders get to visually-lossless at ~6 Mbps for 1080p. **MP4 / H.265 (HEVC)** — ~30% smaller than H.264 at equivalent quality. Now broadly supported in browsers but still has licensing complexity for batch production. Use for archival, skip for web embed. **WebM / VP9** — Google's royalty-free codec. Supported in every major browser except for Safari edge cases. Compression is competitive with H.265. **The only browser-playable format that supports alpha channels.** **WebM / AV1** — the newer royalty-free codec. Better compression than VP9. Hardware decode is now common in mid-range hardware. Use for high-volume video where bandwidth dominates cost. **GIF** — 256 colors, no audio, no codec, file sizes that embarrass everyone. Still the only motion format that renders in-line in email clients. Use only when you have to. ## When you actually need WebM One reason, mostly: **transparency**. If you need a video with an alpha channel (a character animation that drops onto an arbitrary background, a product shot with the background already cut out), WebM with VP9 and an alpha layer is the only browser-playable option. The encoding command, roughly: ```bash ffmpeg -i input-with-alpha.mov \ -c:v libvpx-vp9 -auto-alt-ref 0 \ -pix_fmt yuva420p \ -b:v 2M \ output.webm ``` Two flags matter: `-auto-alt-ref 0` (alpha videos break with alt-ref frames) and `-pix_fmt yuva420p` (the alpha-aware pixel format). In HTML: ```html <video autoplay muted loop playsinline> <source src="overlay.webm" type="video/webm"> </video> ``` No MP4 fallback — there is no MP4 with alpha that browsers play. If you need a fallback, render a separate composited MP4 onto the expected background. ## When you have to use GIF One reason: **email**. Every major email client (Gmail, Outlook, Apple Mail, every B2B mailer) renders animated GIFs in-line. None of them render video. Rules for email GIFs: - **Cap file size at 1 MB.** Outlook truncates animations past ~1.2 MB to the first frame. - **First frame is critical.** Many clients show only the first frame on initial open. Make it a complete, readable image. - **6 seconds is the practical ceiling.** Past that, file size goes up faster than engagement does. The dithering workflow that produces good GIFs from an MP4 source: ```bash ffmpeg -i input.mp4 -vf "fps=12,scale=600:-1:flags=lanczos,split[a][b];[a]palettegen[p];[b][p]paletteuse" out.gif ``` The `palettegen` + `paletteuse` pass gives you a per-clip optimized palette, which cuts file size by 40-60% compared to the default. Run it. ## Autoplay rules (the thing that bites everyone) Every browser blocks autoplay for videos with audio. The rule is: - **`muted` + `playsinline` + `autoplay`** = plays automatically. - **Audio present** = waits for user interaction. For hero-loop video on a landing page, always render silent and let the user unmute if they want. ```html <video autoplay muted loop playsinline> <source src="hero.mp4" type="video/mp4"> </video> ``` All four attributes matter. Drop `playsinline` and iOS will full-screen-takeover the video. ## File size targets For a typical landing-page hero loop (8 seconds, 1920×1080, no audio): <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:#fff;font-family:ui-sans-serif,system-ui;display:grid;place-items:center;height:280px;} .tbl{width:520px;background:rgba(255,255,255,.05);border-radius:12px;overflow:hidden;border:1px solid rgba(255,255,255,.08);} .row{display:grid;grid-template-columns:1.4fr 1fr 1fr;padding:10px 14px;border-bottom:1px solid rgba(255,255,255,.06);align-items:center;} .row:last-child{border-bottom:0;} .row.h{background:rgba(255,255,255,.04);font:600 11px ui-monospace,monospace;letter-spacing:.18em;text-transform:uppercase;color:rgba(255,255,255,.55);} .row b{font-weight:700;} .row i{font-style:normal;font-variant-numeric:tabular-nums;} .win{color:{{$ACCENT}};font-weight:700;} </style> <div class="tbl"> <div class="row h"><span>Format</span><span>File size</span><span>Visual</span></div> <div class="row"><b>GIF 1080p 8s</b><i>{{$GIF}} MB</i><i>poor</i></div> <div class="row"><b>MP4 / H.264</b><i class="win">{{$H264}} MB</i><i class="win">good</i></div> <div class="row"><b>WebM / VP9</b><i>{{$VP9}} MB</i><i>good</i></div> <div class="row"><b>MP4 / H.265</b><i>{{$H265}} MB</i><i>great</i></div> <div class="row"><b>WebM / AV1</b><i>{{$AV1}} MB</i><i>great</i></div> </div>`} knobs={[ { name: "GIF", label: "GIF (MB)", default: "12.4" }, { name: "H264", label: "H.264 (MB)", default: "1.9" }, { name: "VP9", label: "VP9 (MB)", default: "1.4" }, { name: "H265", label: "H.265 (MB)", default: "1.2" }, { name: "AV1", label: "AV1 (MB)", default: "0.9" }, { name: "ACCENT", label: "Highlight", type: "color", default: "#ff3b1f" }, { name: "BG", label: "Background", type: "color", default: "#0a0a0a" } ]} /> The numbers are representative, not authoritative; bitrate tuning shifts each by ±20%. The shape is what matters: MP4 / H.264 is the right size-to-quality tradeoff for 95% of cases, and AV1 is the future for high-volume delivery. ## When MP4 H.265 is worth the friction Two cases: 1. **You ship a lot of video and CDN cost matters.** 30% smaller files compound across millions of plays. 2. **You ship to mobile and bandwidth-sensitive markets.** The compression delta is the user experience. For everything else, the licensing complexity of H.265 outweighs the file-size win. ## The deterministic render question [HyperFrames](/tools/html-to-video) defaults to MP4 / H.264 for output because it's the format the receiving editor / browser / platform actually wants. If you need WebM with alpha (for an overlay product) or batch AV1 (for CDN cost), both are flag-flips on the render command. The render pipeline is format-agnostic past the frame-capture step — the same frames composite to any of the five formats above. ## Practical recommendations - **Default to MP4 / H.264 unless you have a reason not to.** - **Reach for WebM only for alpha.** - **Reach for GIF only for email.** - **Reach for H.265 / AV1 only when CDN cost dominates.** Most of the time, the answer is one format. Pick it, render it, ship it. --- # FFmpeg vs HTML rendering — when each one is the right tool URL: https://hyperframes.video/blog/ffmpeg-vs-html-rendering Published: 2026-04-12T09:00:00.000Z Tags: ffmpeg, video, engineering, comparison, pipeline Author: kira-tanaka If you've ever needed to render video on a server, you have met FFmpeg. The "FFmpeg vs X" comparison framing is misleading though — FFmpeg is not a competitor to HTML-to-MP4 rendering. The two solve different problems and work best together. The interesting question is "which one for which step." This post is the engineering walk-through: what each tool is genuinely good at, where they overlap, where they don't, and how a real production pipeline composes them. ## What each tool actually is **FFmpeg** is a codec toolkit. It takes pixel data in (frames, raw video, source files) and produces encoded video out. It is excellent at: format conversion, codec encoding, audio/video sync, filter chains (blur, scale, color correction), and concatenating clips. It is not designed to *generate* content from scratch; it transforms content that already exists. **HTML-to-MP4 rendering** (the [HyperFrames pipeline](/tools/html-to-video), Remotion, similar approaches) is a *generator*. You write HTML/CSS/SVG; a headless browser rasterizes each frame; the frames are encoded into video. The browser is the renderer; FFmpeg (or another encoder) is the final mux step. In other words: **HTML-to-MP4 systems use FFmpeg under the hood for the encode step.** They are not alternatives; HTML-rendering is a layer on top. ## When HTML rendering wins The clear case: **anything generative, data-driven, or visually designed.** - **A pricing card with a customer's name in it.** HTML wins — the layout is text-and-CSS native. - **A chart from a JSON file.** HTML wins — SVG handles arbitrary data shapes. - **A 9:16 social ad with a typography animation.** HTML wins — kerning, line breaks, brand color tokens. - **An onboarding video customized per user.** HTML wins — the template logic is JSX. - **A 4-second loop for a marketing site.** HTML wins — same template as the site, no asset roundtrip. For every case where the content is generated from data or design code, an HTML pipeline is faster to iterate on, cheaper per variant, and produces deterministic output. ## When FFmpeg wins The clear case: **anything that operates on existing video content.** - **Trim, concat, splice.** Three clips into one — FFmpeg, one command. - **Color correction.** A LUT or contrast curve over an existing render. - **Codec conversion.** MP4 to WebM, H.264 to H.265, MOV to MP4. - **Audio mixing.** Voiceover + background music + render audio. - **Format compliance.** Re-mux for a specific platform's spec. - **Stabilization, deinterlacing, denoising.** Image-domain transforms on captured footage. FFmpeg is the right tool any time the source is already a video file. ## Where they overlap Three areas, all worth knowing about: 1. **Text overlays.** FFmpeg's `drawtext` filter renders text on top of video. It works. It is also painful to use — font path quoting, escaping, no kerning control. For a single-line burned-in subtitle on a captured clip, `drawtext` is fine. For typography-driven graphics, use HTML. 2. **Watermarking.** Adding a logo PNG to the corner. FFmpeg's `overlay` filter is the standard tool. Use it for batch-watermarking existing files. If the watermark is part of the design (animated, positioned per-template), it belongs in the HTML layer. 3. **Concatenation.** Joining renders together. HTML pipelines can render multi-scene videos directly, but for assembling pre-existing assets, FFmpeg's `concat` demuxer is faster: ``` ffmpeg -f concat -i list.txt -c copy out.mp4 ``` The `-c copy` flag is critical — it muxes without re-encoding, which is essentially free. ## A real pipeline composing both A production video workflow we run regularly: ``` [Data] ┐ [Template HTML/CSS] ┼─→ Render frames [Variant parameters] ┘ (headless browser) ↓ [PNG frame sequence] ↓ Encode with FFmpeg ↓ [Silent MP4] ↓ Mux with audio (FFmpeg) ↓ [Final MP4] ↓ Per-platform re-encode (FFmpeg) ↓ [TikTok cut] [Reels cut] [YouTube cut] ``` Five FFmpeg invocations across the pipeline; one HTML render. They each do the part they are good at. ## Performance: what's fast, what's slow Rough numbers from our production pipelines (1080p, 30fps): <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:#fff;font-family:ui-sans-serif,system-ui;display:grid;place-items:center;height:340px;} .tbl{width:540px;background:rgba(255,255,255,.04);border-radius:12px;border:1px solid rgba(255,255,255,.08);overflow:hidden;} .row{display:grid;grid-template-columns:1.6fr 1fr 1fr;padding:11px 14px;border-bottom:1px solid rgba(255,255,255,.06);font-size:13px;align-items:center;} .row.h{background:rgba(255,255,255,.04);font:600 11px ui-monospace,monospace;letter-spacing:.18em;text-transform:uppercase;color:rgba(255,255,255,.55);} .row:last-child{border-bottom:0;} .row b{font-weight:700;} .row i{font-style:normal;font-variant-numeric:tabular-nums;} .win{color:{{$ACCENT}};font-weight:700;} </style> <div class="tbl"> <div class="row h"><span>Operation</span><span>FFmpeg only</span><span>HTML + FFmpeg</span></div> <div class="row"><b>Trim + concat 3 clips</b><i class="win">{{$T1F}}s</i><i>n/a</i></div> <div class="row"><b>Render 1 variant from data</b><i>n/a</i><i class="win">{{$T2H}}s</i></div> <div class="row"><b>Watermark 100 clips</b><i class="win">{{$T3F}}s</i><i>{{$T3H}}s</i></div> <div class="row"><b>Render 100 personalized</b><i>n/a</i><i class="win">{{$T4H}}s</i></div> <div class="row"><b>Add audio track</b><i class="win">{{$T5F}}s</i><i>n/a</i></div> </div>`} knobs={[ { name: "T1F", label: "Trim+concat (s)", default: "2.1" }, { name: "T2H", label: "Render 1 (s)", default: "8.4" }, { name: "T3F", label: "Watermark 100 (s)", default: "90" }, { name: "T3H", label: "HTML watermark (s)", default: "240" }, { name: "T4H", label: "Batch 100 (s)", default: "210" }, { name: "T5F", label: "Audio mux (s)", default: "0.4" }, { name: "ACCENT", label: "Highlight", type: "color", default: "#ff3b1f" }, { name: "BG", label: "Background", type: "color", default: "#0a0a0a" } ]} /> The shape: anything that operates on existing pixels is fastest in FFmpeg (because it can stream and avoid re-encoding). Anything that generates new pixels is fastest in HTML (because it parallelizes across variants). ## What FFmpeg can't easily do Things you'll regret trying to do in FFmpeg alone: - **Text with brand typography.** No kerning control, no web font support, no auto-sizing. - **A bar chart that animates.** FFmpeg has no concept of "data." - **Per-row variants from a CSV.** FFmpeg processes one input at a time. - **A design system.** CSS gives you tokens; FFmpeg gives you flags. Push these to HTML. ## What HTML rendering can't easily do Things you'll regret trying to do in HTML alone: - **Audio mixing or sync.** Browsers do not deterministically render audio. Generate the silent video, mux audio with FFmpeg. - **Color grading captured footage.** Use a LUT in FFmpeg or a video-editing tool. - **Concatenating pre-rendered files.** Two FFmpeg-emitted MP4s concat in ~2 seconds with `-c copy`. Re-rendering both through HTML would take minutes. - **Live streaming.** Different pipeline entirely. Push these to FFmpeg. ## The right mental model **FFmpeg handles pixels you already have. HTML handles pixels you're about to create.** Once you internalize that, the architectural question becomes mechanical: any time you're about to write a complicated FFmpeg filter chain to draw something, switch to HTML. Any time you're about to ask a headless browser to load an existing video clip, switch to FFmpeg. The [HyperFrames pipeline](/tools/html-to-video) treats this as a default — the renderer emits frames, an embedded FFmpeg muxes them, and there's a CLI hook for any custom FFmpeg passes you want to run on the output. Both tools, in order, one command. [Open the playground](/playground), generate the source, hand it to FFmpeg for the final mile. --- # Animated pricing table videos — render once, A/B test forever URL: https://hyperframes.video/blog/animated-pricing-table-video Published: 2026-04-10T09:00:00.000Z Tags: pricing, saas, animation, video, tutorial, ab-test Author: marcus-okafor The pricing table is the highest-stakes piece of content on most SaaS sites. It's also one of the highest-ROI things to A/B test. The catch: every variant means a new design, a new ad, a new video — at a cost that usually makes teams skip the testing altogether. A code template solves both problems at once. Build the pricing table once in HTML, expose the variables (tier names, prices, features, accent color), render any number of variants as MP4 ads. Test the recommended-tier framing this week; test discount messaging next week; test enterprise positioning the week after. Same template, different data, ten ads a month. This is the engineering build, including the conversion-tested motion patterns that actually move the needle on pricing pages. ## The four-shot pricing-ad structure A 15-second pricing-ad reads as four shots: | Shot | Duration | Job | |---|---|---| | Frame | 0.0–1.0s | Headline ("Pricing that scales with your team") | | Reveal | 1.0–3.5s | Tier columns cascade in, recommended highlighted | | Compare | 3.5–10s | Feature rows reveal, checkmarks animate | | CTA | 10–15s | Single tier focus, sign-up button, accent glow | The whole thing rests on the reveal shot — that's where the eye locks onto the recommended tier. ## The tier-card geometry Three columns, equal width, the middle one ~12% taller. The height difference is what tells the eye "this is the recommended choice" before any color or label does. Add a 1px accent border and a subtle inner glow on the middle card. The conversion-tested layout, per card: 1. **Tier name** (uppercase, tracked, small). 2. **Price** (big, tabular-nums, accent color for recommended). 3. **Billing period** (smaller, muted). 4. **Feature list** (5–7 rows, checkmarks). 5. **CTA button** (full-width inside the card). Anything more — a "save 20%" pill, a "popular" banner — is optional and should only go on the recommended tier. <InlineSandbox html={`<!doctype html><html><head><style> @property --p { syntax: '<percentage>'; initial-value: 0%; inherits: false; } body{margin:0;background:#0a0a0a;color:#fff;font-family:ui-sans-serif,system-ui;display:grid;place-items:center;min-height:100vh;padding:24px;box-sizing:border-box;} .row{display:grid;grid-template-columns:repeat(3,1fr);gap:12px;width:560px;align-items:end;} .card{background:#141416;border:1px solid #1f1f1f;border-radius:14px;padding:18px;display:grid;gap:12px;animation:in .5s ease-out both;} .card:nth-child(1){animation-delay:0.05s;} .card:nth-child(2){animation-delay:0.15s;padding-top:30px;padding-bottom:30px;background:linear-gradient(180deg,#181018,#141416);border-color:#ff3b1f55;box-shadow:0 0 0 1px #ff3b1f33,0 12px 40px #ff3b1f22;position:relative;} .card:nth-child(3){animation-delay:0.25s;} @keyframes in{from{opacity:0;transform:translateY(12px);}to{opacity:1;transform:translateY(0);}} .tier{font:600 10px ui-monospace,monospace;letter-spacing:.25em;text-transform:uppercase;color:rgba(255,255,255,.55);} .price{font:700 36px ui-sans-serif,system-ui;font-variant-numeric:tabular-nums;letter-spacing:-.02em;line-height:1;} .card.r .price{color:#ff3b1f;font-size:42px;} .period{font-size:11px;color:rgba(255,255,255,.55);} .feat{display:grid;gap:6px;margin-top:6px;} .feat div{font-size:12.5px;display:flex;gap:8px;} .feat div::before{content:"✓";color:#ff3b1f;} .btn{padding:9px;border-radius:9px;background:rgba(255,255,255,.08);color:#fff;font:600 12px ui-monospace,monospace;letter-spacing:.18em;text-transform:uppercase;text-align:center;} .card.r .btn{background:#ff3b1f;} .pill{position:absolute;top:-10px;left:50%;transform:translateX(-50%);background:#ff3b1f;color:#fff;padding:3px 9px;border-radius:999px;font:700 9px ui-monospace,monospace;letter-spacing:.2em;text-transform:uppercase;} </style></head><body> <div class="row"> <div class="card"><div class="tier">Starter</div><div class="price">$0</div><div class="period">forever free</div><div class="feat"><div>100 renders / mo</div><div>1 seat</div><div>Community support</div></div><div class="btn">Start</div></div> <div class="card r"><div class="pill">Popular</div><div class="tier">Pro</div><div class="price">$29</div><div class="period">per user / mo</div><div class="feat"><div>5,000 renders / mo</div><div>5 seats</div><div>Priority support</div><div>API access</div></div><div class="btn">Start free trial</div></div> <div class="card"><div class="tier">Team</div><div class="price">$99</div><div class="period">per workspace</div><div class="feat"><div>Unlimited renders</div><div>20 seats</div><div>SSO + audit log</div></div><div class="btn">Contact sales</div></div> </div> </body></html>`} height={400} caption="Three tiers, cascade reveal, middle card raised and glowing." /> ## The "popular" plan signal Three signals on the recommended tier, all subtle: 1. **+10–15% taller** than its neighbors. 2. **1px accent border + soft outer glow.** 3. **"POPULAR" pill** above the card. Avoid: changing the background to a different color, doubling the tier name, adding a "BEST VALUE" yellow banner. The point is to nudge, not to club the viewer. ## Animation: cascade, then compare The reveal: - **Cards cascade in** from below, 80ms stagger. Total runtime under 400ms. - **Recommended card pulses once** at the end of the cascade (scale 1.0 → 1.02 → 1.0, 600ms). Single pulse, not a loop. - **Feature rows reveal** with a 40ms stagger after each card lands. - **Checkmarks draw** with a 200ms stroke trace, 50ms after each row appears. The whole sequence is over by 3.5 seconds. Anything longer and the ad feels slow. <VariableKnobs html={`<style>body{margin:0;background:{{$BG}};color:#fff;font-family:ui-sans-serif,system-ui;display:grid;place-items:center;height:400px;padding:24px;box-sizing:border-box;} .row{display:grid;grid-template-columns:repeat(3,1fr);gap:12px;width:520px;align-items:end;} .card{background:#141416;border:1px solid #1f1f1f;border-radius:14px;padding:18px;display:grid;gap:10px;} .card.r{padding-top:30px;padding-bottom:30px;background:linear-gradient(180deg,#181018,#141416);border-color:{{$ACCENT}}55;box-shadow:0 0 0 1px {{$ACCENT}}33,0 12px 40px {{$ACCENT}}22;position:relative;} .tier{font:600 10px ui-monospace,monospace;letter-spacing:.25em;text-transform:uppercase;color:rgba(255,255,255,.55);} .price{font:700 32px ui-sans-serif,system-ui;font-variant-numeric:tabular-nums;letter-spacing:-.02em;line-height:1;} .card.r .price{color:{{$ACCENT}};font-size:38px;} .period{font-size:11px;color:rgba(255,255,255,.55);} .btn{padding:9px;border-radius:9px;background:rgba(255,255,255,.08);color:#fff;font:600 11px ui-monospace,monospace;letter-spacing:.18em;text-transform:uppercase;text-align:center;} .card.r .btn{background:{{$ACCENT}};} .pill{position:absolute;top:-10px;left:50%;transform:translateX(-50%);background:{{$ACCENT}};color:#fff;padding:3px 9px;border-radius:999px;font:700 9px ui-monospace,monospace;letter-spacing:.2em;text-transform:uppercase;} </style> <div class="row"> <div class="card"><div class="tier">{{$T1}}</div><div class="price">{{$P1}}</div><div class="period">{{$D1}}</div><div class="btn">Start</div></div> <div class="card r"><div class="pill">Popular</div><div class="tier">{{$T2}}</div><div class="price">{{$P2}}</div><div class="period">{{$D2}}</div><div class="btn">Start trial</div></div> <div class="card"><div class="tier">{{$T3}}</div><div class="price">{{$P3}}</div><div class="period">{{$D3}}</div><div class="btn">Contact</div></div> </div>`} knobs={[ { name: "T1", label: "Tier 1", default: "Starter" }, { name: "P1", label: "Price 1", default: "$0" }, { name: "D1", label: "Desc 1", default: "forever free" }, { name: "T2", label: "Tier 2 (rec)", default: "Pro" }, { name: "P2", label: "Price 2", default: "$29" }, { name: "D2", label: "Desc 2", default: "per user / mo" }, { name: "T3", label: "Tier 3", default: "Team" }, { name: "P3", label: "Price 3", default: "$99" }, { name: "D3", label: "Desc 3", default: "per workspace" }, { name: "ACCENT", label: "Accent", type: "color", default: "#ff3b1f" }, { name: "BG", label: "Background", type: "color", default: "#0a0a0a" } ]} /> ## Variant testing in production The win of templating: each ad variant is a JSON file, not an After Effects project. Common things to test: - **Annual vs. monthly framing.** "$29/mo" vs. "$290/year saves $58." - **Tier-count.** Three tiers vs. four. Cap at four — five tiers consistently underperform. - **Recommended-tier position.** Middle vs. right. Middle wins ~70% of the time. - **Free tier wording.** "Free forever" vs. "Try free" vs. omit free tier. - **Color of the recommended tier.** Brand color vs. neutral. Brand wins for high-trust products, neutral for high-utility ones. Each test is one JSON edit and a re-render. The [batch render](/blog/batch-personalized-videos-from-csv) pipeline does the rest. ## Three aspect ratios from one template The same template renders to three sizes for ad platforms: - **1920×1080** — YouTube pre-roll, in-feed Facebook, LinkedIn. - **1080×1080** — Instagram in-feed, X. - **1080×1920** — Reels, Shorts, TikTok. Pricing tables don't fit naturally into 9:16 — solve by showing one tier at a time, with a horizontal swipe between them. The same data, different presentation. ## When the table isn't enough For high-ticket B2B sales ($1000+/mo plans), a pricing table video alone doesn't close. Pair the table render with: - **A specific value-add per tier** (rendered as a separate 6-second cut). - **A testimonial frame** ("Used by Linear, Vercel, Figma"). - **An ROI calculator embed** on the landing page itself. The video gets the click; the page does the rest. [Open the playground](/playground), drop your tiers in, render the ad cuts.