Fynanc Guide Characters — Voice & Avatar Options

Quick Summary — Recommended Picks

Character	Voice	Avatar
Mary Poppins	Voice-design: warm British nanny preset	custom-generated portrait → talking-photo avatar
Bob Ross	Voice-design: soft, low male “Soothing” preset	custom-generated portrait → talking-photo avatar
Ms. Frizzle	Voice-design: bright, quirky female “Enthusiastic” preset	Illustrated cartoon → animated talking character
Bill Nye	Voice-design: fast, energetic male “Explainer” preset	custom-generated portrait → talking-photo avatar
Athena	Voice-design: regal, measured female “Authoritative” preset	custom-generated portrait → talking-photo avatar

☃

Mary Poppins

Warm-firm British nanny • Onboarding guide

Canonical Voice Signature

Accent: Received Pronunciation (RP) British — crisp, proper, educated

Pitch: Mid-to-high soprano range; clear and bell-like

Pace: Measured and deliberate; never rushed, every word placed with precision

Energy: Warm but firm — affectionate underneath an exterior of brisk propriety

Cadence: Musical lilt with rising inflections on instructions; matter-of-fact on corrections

Signature phrases: “Spit-spot!” • “Practically perfect in every way” • “A spoonful of sugar helps the medicine go down”

Reference: Julie Andrews in Mary Poppins (1964) — classically trained soprano with West End/Broadway theater projection. Emily Blunt’s 2018 version is slightly lower-pitched and drier.

Voice Options

V1: Voice-Design — Custom British Nanny

Use an AI voice-design tool to create a custom voice with these parameters: Female, British RP accent, mid-high pitch, warm but proper tone. Settings: Stability 0.70 (consistent but not robotic), Similarity Boost 0.75, Style Exaggeration 0.30 (slight theatrical warmth), Speed 0.90 (just below normal for measured pacing).

+ Closest match to the character; tunable; no licensing concern with the voice itself

- Requires iteration to get the RP accent crisp enough

Effort: Low-Medium (1-2 hours of voice tuning)

V2: Stock Voice — “Grandmother” / “Storyteller” Preset

Several voice libraries offer warm, mature British female voices labeled “Grandmother,” “Storyteller,” or “Posh.” These have built-in warmth and a nurturing quality. Apply slight speed reduction (0.90) and increase stability (0.75) for the measured Mary Poppins cadence.

+ Ready to use; warm and maternal tone out of the box

- May sound more “gentle grandma” than “brisk nanny” — missing the crisp firmness

Effort: Low (30 min to test and adjust)

V3: Voice-Clone from Film Clip

Clone a voice profile from a short clip of Julie Andrews speaking (not singing) in the original film. Use a 30-60 second clip of dialogue, then fine-tune stability and speed.

+ Most authentic sound

- Potential IP/rights concern (cloning a real actress’s voice); quality depends on clip clarity

Effort: Medium (sourcing clean audio + clone training)

Avatar Options (Video, Not Still)

A1: Custom-Generated Portrait → Talking-Photo Avatar

Generate a high-quality character portrait using AI image generation: “A warm, elegant British nanny in a dark navy coat and hat with a small flower, holding an umbrella, half-body portrait, friendly but proper expression, soft studio lighting, cream background, photorealistic” — then feed this into a talking-photo/talking-head avatar tool to create a lip-synced video.

+ Full creative control over look; photorealistic result; consistent framing with headroom; no licensing issue with the image

- Photorealistic style may feel slightly uncanny for a fictional character; needs good portrait prompt

Effort: Medium (image gen + avatar render ~2-3 hours)

A2: Illustrated Cartoon → Animated Talking Character

Create a stylized cartoon/watercolor illustration of the character (Disney-inspired warm illustration style), then use an animation tool to add lip-sync and subtle head movement. Most “kids show” feel.

+ Most appropriate for a 12-year-old audience; avoids uncanny valley; charming and approachable

- Animated lip-sync tools for illustrations are less mature; may look less polished than photo-avatar

Effort: Medium-High (illustration + animation pipeline ~3-4 hours)

A3: Stock Avatar — Closest Match

Use a pre-built stock avatar from an AI video platform. Look for: mature female, professional/formal attire, warm expression. Closest stock matches tend to be “business professional” women — none will have the hat/umbrella iconography.

+ Fastest option; production-ready immediately

- Won’t look like Mary Poppins at all; loses character identity

Effort: Very Low (15 min)

Reference: “Spoonful of Sugar” scene Reference: “Spit-Spot” compilation

🎨

Bob Ross

Calm, soft-spoken guide • Mistakes-are-OK coach

Canonical Voice Signature

Accent: Gentle American (Florida/general Southern softness); no strong regionalism

Pitch: Low-to-mid baritone; soft and quiet, almost a whisper at times

Pace: Very slow and deliberate — long pauses between thoughts, never rushed

Energy: Ultra-calm; the “godfather of ASMR” — intentionally soothing after years in the military

Cadence: Gentle rising tone when introducing ideas (“Let’s put a happy little...”), soft falling tone on completions. Frequent reassuring murmurs.

Signature phrases: “Happy little trees” • “We don’t make mistakes, just happy accidents” • “There are no mistakes, only happy accidents” • “Let’s get crazy”

Reference: Bob Ross on The Joy of Painting (1983-1994) — 403 episodes of his signature whisper-calm delivery. His son Steve Ross confirmed he deliberately adopted this soft style as a contrast to his mentor Bill Alexander’s harsher manner.

Voice Options

V1: Voice-Design — Custom Soothing Male

Create a custom voice with: Male, American accent (soft/neutral), low-mid pitch, very calm energy. Settings: Stability 0.80 (very consistent, no surprises), Similarity Boost 0.65, Style Exaggeration 0.15 (minimal — understated is the whole point), Speed 0.75 (noticeably slower than normal for that trademark unhurried feel).

+ Tunable to get the whisper-calm quality; the slow speed setting is critical and achievable

- Hard to capture the genuine warmth without sounding sleepy or monotone

Effort: Low-Medium (1-2 hours)

V2: Stock Voice — “Soothing” / “Gentle” / “Mentor” Preset

Voice libraries offer “Soothing” and “Gentle” male voices designed for calming content. Apply speed 0.75 and stability 0.80. The “Mentor” preset adds a wise-teacher quality that fits Bob Ross’s encourager role.

+ Quick to deploy; soothing quality built in

- Generic calm voice ≠ Bob Ross’s specific whisper-warmth

Effort: Low (30 min)

V3: Voice-Clone from Show Clip

Clone from a clean segment of The Joy of Painting — use a 60-second monologue clip (plenty of high-quality audio exists on YouTube). His voice is extremely distinctive and clones well due to consistent delivery.

+ Most recognizable result; his voice is iconic and kids would instantly “get it”

- Rights concern (Bob Ross Inc. controls his likeness/voice commercially); fine for private use

Effort: Medium (source audio + clone ~1-2 hours)

Avatar Options (Video, Not Still)

A1: Custom-Generated Portrait → Talking-Photo Avatar

Generate portrait: “A friendly man with a large curly brown afro and a neat beard, wearing a blue button-down shirt, holding a paint palette, half-body portrait, warm gentle smile, soft studio lighting, nature-green background, photorealistic” — then create talking-head video with lip-sync.

+ Iconic look is easy to capture (afro + blue shirt + palette); photorealistic works well for a real person character; proper headroom framing

- Getting the exact warmth in the eyes/smile takes prompt iteration

Effort: Medium (2-3 hours)

A2: Illustrated Cartoon → Animated Talking Character

Warm, hand-painted illustration style (like a PBS kids show). His look is simple and iconic: curly afro, beard, blue shirt, palette. Animate with gentle head movement and lip-sync.

+ Very kid-friendly; his visual is so iconic even a cartoon version is instantly recognizable

- Cartoon lip-sync may not match the ASMR calm energy as well as photo-avatar

Effort: Medium-High (3-4 hours)

A3: Stock Avatar — Closest Match

Look for: casual male, beard, friendly expression, neutral/outdoor background. Stock avatars won’t have the afro or painter aesthetic.

+ Fast

- Won’t be recognizable as Bob Ross; misses the entire visual identity

Effort: Very Low (15 min)

Reference: Full episode “Island in the Wilderness” Reference: “Happy little trees” compilation

🌌

Ms. Frizzle

Whimsical, adventurous educator • Choose-your-adventure guide

Canonical Voice Signature

Accent: Standard American with theatrical flair; slightly nasal, expressive

Pitch: Mid-range female with wide variation — swoops high on excitement, drops low for dramatic emphasis

Pace: Energetic and varied — speeds up with excitement, pauses dramatically for effect

Energy: High and infectious; delighted by discovery; unflappable in chaos

Cadence: Rhythmic and theatrical with a sing-song quality on catchphrases; voice rises on questions like she’s genuinely curious

Signature phrases: “Take chances, make mistakes, get messy!” • “Seatbelts, everyone!” • “At my old school, we never...” • “Wahoo!”

Reference: Lily Tomlin voiced the original Magic School Bus (1994-1997) with her trademark deadpan-then-explosive delivery. Kate McKinnon voiced the reboot (2017-2021) with more continuous high energy. Lily Tomlin’s version is the classic — theatrical, slightly quirky, with perfect comic timing.

Voice Options

V1: Voice-Design — Custom Quirky Educator

Create custom voice with: Female, American accent, mid-pitch with wide range, bright/enthusiastic tone. Settings: Stability 0.45 (lower stability = more expressive variation, which is essential for Frizzle), Similarity Boost 0.70, Style Exaggeration 0.65 (high — she’s theatrical), Speed 1.10 (slightly fast, energetic).

+ Low stability + high style exaggeration captures her dramatic delivery swings; most customizable

- Low stability can sometimes produce inconsistent takes; may need multiple generations

Effort: Medium (2 hours of tuning for the right energy)

V2: Stock Voice — “Enthusiastic” / “Perky” / “Playful” Preset

Voice libraries have “Enthusiastic” and “Perky” female voices. Increase style exaggeration to 0.50+ and speed to 1.10. The “Playful” preset may add the right whimsy.

+ Good energy baseline; less tuning needed

- May sound generically cheerful rather than specifically Frizzle-quirky

Effort: Low (45 min)

V3: Voice-Clone from Show Clip

Clone from a clip of Lily Tomlin’s Frizzle — use the “Take chances, make mistakes, get messy!” compilation or a field trip intro scene. Her delivery is highly distinctive.

+ Instantly recognizable; the theatrical quirk is hard to recreate synthetically

- Scholastic/PBS rights concern for commercial use (fine for homeschool); animated show audio quality varies

Effort: Medium (1-2 hours)

Avatar Options (Video, Not Still)

A1: Illustrated Cartoon → Animated Talking Character

Ms. Frizzle is originally an animated character, so cartoon style is the most natural fit. Create a vibrant cartoon illustration: “A quirky female teacher with wild curly red hair, wearing a colorful dress with a fun science pattern (planets, molecules), big earrings, bright smile, half-body, cartoon/animation style, colorful classroom background” — then animate with lip-sync and expressive head/hand movements.

+ Most authentic to the character’s origin; most “kids show” feel; avoids uncanny valley entirely; a 12-year-old would find this most natural

- Animated character tools are still maturing; may need a specialized animation pipeline

Effort: Medium-High (3-4 hours)

A2: Custom-Generated Portrait → Talking-Photo Avatar

Generate a photorealistic portrait of a woman matching Frizzle’s description (red curly hair, colorful patterned dress, fun earrings, bright expression), then create talking-head video.

+ Higher production quality for lip-sync; consistent with the other characters if going photo-avatar for all

- Photorealistic Frizzle feels wrong — she’s inherently a cartoon character; may feel uncanny

Effort: Medium (2-3 hours)

A3: Stock Avatar — Closest Match

Look for: energetic female, colorful attire, teacher setting. No stock avatar will capture the wild red hair + science dress combo.

+ Fast

- Completely loses the Frizzle identity

Effort: Very Low (15 min)

Reference: “Take chances, make mistakes!” compilation Reference: Magic School Bus intro

🔬

Bill Nye

Fast, energetic science educator • Formula & calculator guide

Canonical Voice Signature

Accent: Standard American (Pacific Northwest origin); clear, broadcast-quality diction

Pitch: Mid-range male; uses pitch jumps dramatically — rises with excitement, drops for punchlines

Pace: Fast and punchy — MTV-paced delivery with quick cuts between ideas; the show was deliberately styled as rapid-fire

Energy: Very high; genuine excitement about science; slightly nerdy enthusiasm that’s infectious rather than overbearing

Cadence: Staccato bursts of explanation punctuated by “consider the following” pauses; uses vocal emphasis on key science terms; occasional quirky humor

Signature phrases: “BILL! BILL! BILL! BILL!” (theme chant) • “Science rules!” • “Consider the following” • “Now you know!”

Reference: Bill Nye on Bill Nye the Science Guy (1993-1998) — 100 episodes. Known for “high-energy presentation and MTV-paced segments.” Research confirmed regular viewers were better at explaining science than non-viewers, validating his energetic approach.

Voice Options

V1: Voice-Design — Custom Energetic Explainer

Create custom voice with: Male, American accent (clear/broadcast), mid-pitch, enthusiastic energy. Settings: Stability 0.50 (allows expressive variation for his dramatic pitch jumps), Similarity Boost 0.70, Style Exaggeration 0.55 (his energy is theatrical but not over-the-top), Speed 1.15 (noticeably fast, matching his rapid-fire delivery).

+ Best balance of energy and clarity; speed setting captures the rapid-fire feel; tunable pitch variation

- Needs careful tuning to avoid sounding rushed vs. intentionally fast-and-clear

Effort: Low-Medium (1-2 hours)

V2: Stock Voice — “Explainer” / “Motivational” / “Fast-Paced” Preset

Voice libraries offer “Explainer Voiceover” (designed for educational content) and “Fast-Paced” male voices. The “Motivational Speaker” preset adds the right punch. Increase speed to 1.15 and style to 0.45.

+ “Explainer Voiceover” presets are literally built for this use case; good diction built in

- May sound like a podcast narrator rather than a science showman

Effort: Low (30-45 min)

V3: Voice-Clone from Show Clip

Clone from a “Consider the Following” segment or intro monologue. Bill Nye is a real public figure with abundant clean audio. His voice is distinctive but not as unique as Bob Ross’s — it’s the energy more than the timbre.

+ Authentic sound; abundant source material

- He’s a living public figure; voice cloning has higher sensitivity. Private use should be fine.

Effort: Medium (1-2 hours)

Avatar Options (Video, Not Still)

A1: Custom-Generated Portrait → Talking-Photo Avatar

Generate portrait: “A friendly man in a blue lab coat and bow tie, safety goggles on his forehead, enthusiastic expression with a slight grin, half-body portrait, colorful science lab background with beakers and test tubes, photorealistic, well-lit” — then create talking-head video with animated lip-sync. The bow tie is the key visual signature.

+ Lab coat + bow tie = instant science educator identity; photorealistic matches a real-person character; great headroom framing

- May look too generically “scientist” without specific resemblance

Effort: Medium (2-3 hours)

A2: Illustrated Cartoon → Animated Talking Character

Cartoon-style illustration: energetic man in lab coat and bow tie, animated with dynamic head movements and hand gestures to match his high energy.

+ Fun, kid-friendly; can exaggerate expressions to match the energy

- Bill Nye is a real person, so cartoon may feel disconnected; less impactful than photo-avatar for this character

Effort: Medium-High (3-4 hours)

A3: Stock Avatar — Closest Match

Look for: male, professional/educator type, animated expressions. Some platforms have “professor” or “educator” stock avatars with a science lab backdrop.

+ Fastest option

- Generic educator; no bow tie, no science-guy identity

Effort: Very Low (15 min)

Reference: “Consider the Following” compilation Reference: Best of Bill Nye

🏆

Athena

Wise, measured, classical • Reflection & mastery guide

Canonical Voice Signature

Accent: No single canonical version; best portrayals use Transatlantic/mid-Atlantic or a refined neutral accent that sounds “timeless”

Pitch: Alto-to-mezzo range; deep enough for authority, clear enough for wisdom

Pace: Measured and deliberate — every word carries weight; comfortable with silence

Energy: Calm authority — not cold, but regal; warmth comes through in moments of encouragement

Cadence: Even, rhythmic, almost incantatory; rises slightly on questions that invite reflection; declarative statements land firmly

Teaching vibe: “Wisdom is not knowing everything — it is knowing what matters” • “Consider what you have learned” • “The owl sees what others miss”

Reference portrayals: Shohreh Aghdashloo (Expanse) — gravelly, authoritative female; Cate Blanchett (Galadriel) — ethereal, measured; Carrie-Anne Moss (Athena in Telltale Games) — calm and commanding. For a kids’ lesson, aim for “wise teacher with gravitas” rather than “intimidating goddess.”

Voice Options

V1: Voice-Design — Custom Regal Sage

Create custom voice with: Female, neutral/transatlantic accent (timeless, not tied to a specific region), alto-mezzo pitch, calm authority. Settings: Stability 0.75 (very composed and even), Similarity Boost 0.70, Style Exaggeration 0.25 (restrained — Athena doesn’t need theatrics), Speed 0.85 (slower than normal for gravitas and reflection).

+ Most control over the “timeless wisdom” quality; high stability avoids the voice sounding flighty

- “Regal but warm” is a delicate balance; too stable can sound robotic

Effort: Low-Medium (1-2 hours)

V2: Stock Voice — “Authoritative” / “Voice of God” / “Dramatic” Preset

Voice libraries offer “Authoritative” female voices and “Voice of God” (epic narration) presets. The “Dramatic” preset adds a mythological gravitas. Reduce speed to 0.85 and increase stability to 0.75.

+ “Authoritative” presets have the right base tone; “Dramatic” adds the mythic quality

- May sound like a movie trailer narrator rather than a wise teacher

Effort: Low (30-45 min)

V3: Voice-Clone from Film/Game Clip

Clone from a portrayal of a wise female figure (Galadriel, Athena from God of War, etc.). This gives the mythic quality naturally. Use a 30-60 second monologue.

+ Captures the mythic vocal quality that’s hard to synthesize from scratch

- Rights concern with cloning an actress; mythological character has no single “canon” voice

Effort: Medium (1-2 hours)

Avatar Options (Video, Not Still)

A1: Custom-Generated Portrait → Talking-Photo Avatar

Generate portrait: “A wise, regal woman with olive skin, dark wavy hair adorned with a golden laurel crown, wearing a flowing white and gold draped garment, a small owl perched on her shoulder, half-body portrait, serene and knowing expression, warm golden lighting, marble column background, photorealistic, classical Greek aesthetic” — then create talking-head video.

+ Photorealistic goddess portrait can be stunning; golden/white palette matches her brand color on the lesson page; owl adds instant recognition

- “Greek goddess” photorealism can veer into cosplay territory if not done well

Effort: Medium (2-3 hours)

A2: Illustrated Cartoon → Animated Talking Character

Classical illustration style (think Disney’s Hercules Athena or a storybook goddess): warm, approachable, golden-toned. Animate with subtle, dignified movements — minimal head movement, calm lip-sync.

+ Most approachable for a 12-year-old; avoids the cosplay risk; the dignified animation matches her measured energy

- Illustration quality needs to convey gravitas, not just “cartoon goddess”

Effort: Medium-High (3-4 hours)

A3: Stock Avatar — Closest Match

Look for: regal/professional female, dark hair, warm complexion. No stock avatar will have the classical Greek styling, laurel crown, or owl.

+ Fast

- No mythological identity; just looks like a professional woman

Effort: Very Low (15 min)

Reference: Athena in Percy Jackson (film) Reference: Athena in God of War

Production Notes

Consistency recommendation: For a unified look across all 5 guides, pick ONE avatar approach for all characters (all custom-generated portraits, or all illustrated cartoons). Mixing styles may feel disjointed. Exception: Ms. Frizzle works best as an illustration even if others are photo-avatars — she’s inherently a cartoon character.

Voice consistency: All voices should be generated through the same tool/platform so audio quality, format, and “feel” match. Use the same output settings (sample rate, format) for all 5.

Video framing (standing rule): All avatars use half-body framing with headroom — never crop the head. Character should be centered, visible from mid-torso up.

Total estimated effort (recommended picks): ~12-16 hours for all 5 characters (voice design + portrait generation + avatar rendering + QA).

Next step: Pick your preferred voice option (V1/V2/V3) and avatar option (A1/A2/A3) for each character. Then we render them.