05 Digital & AI — Youthwork Playground

05 Digital Tools & AI

The tools changed. The questions didn't.

Artificial intelligence has moved from a technology story to a practical reality in youth work faster than most people anticipated. Not because youth workers sought it out, but because it arrived inside the tools they already use — in email clients, in document editors, in translation apps, in the phones in participants' pockets.

The challenge for youth workers is not learning to use AI. Most already are, whether they realise it or not. The challenge is developing a conscious relationship with it — understanding what the tools actually do, which ones genuinely serve the work, where the risks and limitations lie, and how to use them in ways that align with the values of non-formal education rather than accidentally working against them.

This section maps the current landscape clearly. It is organised by what the tools do rather than by who made them — because the relevant question is always "what do I need to accomplish?" not "which company should I be loyal to?" It covers writing and content creation, image generation, video generation, voice tools, and — the most practically important for inclusive youth work — the specific ways AI can lower barriers for participants with fewer opportunities.

A word about pace. The AI landscape in 2026 moves fast enough that any specific tool comparison risks being outdated within months. What is more durable than the tool names are the categories, the use cases, and the evaluation questions. When a new tool appears — and new ones appear constantly — the questions to ask are consistent: Does it work across multiple languages? What does it cost, and who can afford it? What data does it collect, and from whom? Does it work on low-end devices and slow connections? What happens to participant data? These questions apply to every AI tool you will encounter, now and in the future.

A note on currency

This section reflects the AI tool landscape as of early 2026. Specific tools, model versions, and pricing change frequently. Use this section as a framework and a starting point — not as a definitive directory. Before adopting any tool in your practice, verify current pricing, privacy policies, and capabilities on the tool's own website.

05.2 Text & Writing

Writing, planning, and thinking out loud.

The most immediately useful AI tools for youth workers are the general-purpose writing assistants. They write, edit, translate, summarise, adapt tone, generate programme plans, and draft communications. Used well, they reduce the administrative burden of youth work significantly — freeing time for the parts that actually require a human.

Three platforms dominate this category in 2026, each with genuinely different strengths. The honest answer is that serious users do not pick one — they use different tools for different tasks. What follows is an accurate, opinionated guide to what each is actually best for, based on current performance rather than brand reputation.

Claude (Sonnet 4.6)

By Anthropic

Best for long-form writing, nuanced editing, and following complex instructions precisely.

What it is best for

Writing that needs to sound genuinely human — session plans, grant application narratives, participant communications, complex facilitation guides. Claude consistently outperforms competitors on following detailed instructions, maintaining a consistent voice across long documents, and producing writing that does not feel obviously AI-generated. Its 1 million token context window (in beta) means you can feed it an entire project's worth of documents and ask it to synthesise them.

What to know

Less strong than ChatGPT on image generation and has a smaller plugin ecosystem. The free tier has usage limits. Claude Sonnet 4.6 is the current everyday model — fast, capable, and available on the free tier. Claude Opus 4.6 is the most powerful model for complex reasoning tasks, available on paid plans.

Free tier: Yes — with daily usage limits Best for: long-form writing, grant narratives, complex instructions, voice consistency

ChatGPT (GPT-5)

By OpenAI

Best for image generation, voice conversation, and the broadest ecosystem of integrations.

What it is best for

Tasks that span multiple media — generating a text and then an image to accompany it, voice conversation practice, or accessing the wide plugin ecosystem. ChatGPT's image generation is currently the strongest in any all-in-one platform. Voice Mode has matured significantly and is genuinely useful for language practice and rehearsing facilitation scenarios out loud. Memory across conversations (on paid plans) means it builds context about your work over time.

What to know

GPT-5 is the current flagship model on paid plans. The free tier uses a capable but less powerful model. OpenAI has faced significant ethical controversy in early 2026 around military contracts, which has led some organisations to reconsider their use — worth being aware of if this matters to your organisation's values.

Free tier: Yes — GPT-5 limited on free tier Best for: image + text workflows, voice conversation, plugin ecosystem, language practice

Gemini (2.5 Pro)

By Google

Best for Google Workspace users, research tasks, and multimodal work involving video and audio analysis.

What it is best for

If your organisation lives in Google Workspace — Docs, Slides, Drive, Gmail — Gemini integrates natively and is the most frictionless option. Gemini 3 Flash is the current default free model; Gemini 2.5 Pro is Google's flagship thinking model for complex tasks. Deep Research is a standout feature — conducting multi-source research and synthesising findings far faster than manual research. Gemini is also currently the best at analysing video and audio files, making it useful for reviewing recorded sessions or processing multilingual audio content.

What to know

The free tier is the most generous of the three main platforms — offering capabilities that competitors charge for. If budget is a significant constraint, start here. NotebookLM is also a Google product and integrates with the same account.

Free tier: Yes — most generous of the three platforms Best for: Google Workspace, deep research, video & audio analysis, budget-conscious teams

Perplexity

Best for research with cited sources.

Not a writing tool but a research tool — and the best one available in 2026 for finding accurate, cited information fast. Where ChatGPT and Claude can hallucinate facts, Perplexity grounds every answer in live web sources with citations you can verify. For grant writing, programme research, or fact-checking statistics before putting them in a document, Perplexity is the most reliable starting point. Pro plan gives access to GPT-5, Claude Opus 4.6, and Gemini within one interface.

Free tier: Yes Best for: grant research, fact-checking, cited sources, programme background research

DeepSeek

Best performance-to-cost ratio, especially for API use.

A Chinese-developed model that consistently benchmarks alongside the top Western models at a fraction of the API cost. Relevant for youth organisations building AI into their own tools or processes on a limited budget. Strong on reasoning and structured tasks. Note: operates under Chinese data jurisdiction — organisations handling sensitive participant data should review privacy implications carefully before adopting.

Free tier: Yes Best for: API use, budget-conscious development, structured tasks, reasoning

05.3 Image Generation

Making visuals without a designer.

AI image generation has reached a level of quality in 2026 where the output of the best models is genuinely difficult to distinguish from professional photography or illustration. For youth workers creating programme materials, social media content, workshop visuals, and educational graphics, this is now a practical and accessible tool — not a specialist skill.

The image generation landscape splits into two types of tools: standalone image generators built specifically for image creation, and integrated generators built into the all-in-one AI platforms you already use. Both are worth knowing about. The integrated options are easier to start with — no new account, no new interface, just asking the writing tool you already use to make or edit an image. The standalone tools offer more control, more styles, and better results for specific use cases.

A practical note for youth work: AI-generated images carry ethical considerations that are more significant here than in commercial contexts. Images of people generated by AI can reproduce and amplify racial, gender, and cultural biases in ways that are particularly harmful when working with young people from diverse backgrounds. Be deliberate about what you ask these tools to generate, examine the outputs critically, and when in doubt use images from diverse stock photography libraries rather than generating people with AI.

Within tools you may already use

Integrated · ChatGPT

ChatGPT Image Generation

The most accessible starting point for integrated image generation. Ask ChatGPT to generate an image in the same conversation where you are writing text. The conversational refinement loop — "make it less formal", "add more diverse participants", "change the background to an outdoor setting" — is more intuitive than any standalone tool. Currently the strongest integrated option for following complex instructions precisely and for generating images that include readable text. Access via ChatGPT Plus ($20/month) or limited on free tier.

Best for: integrated text-and-image workflows, prompt-following, images with text

Integrated · Gemini

Gemini with Imagen 4

Google's Imagen 4 model, accessible through Gemini, is one of the strongest image generators for photorealistic scenes and complex multi-subject compositions. Particularly strong at spatial relationships and realistic textures. Available through Google AI Studio for free. Also integrated into Gemini's conversational interface. Gemini's image understanding — analysing and describing existing images — is currently the best of the three main platforms.

Best for: photorealism, complex scenes, Google ecosystem users, image analysis

Integrated · Gemini

Nano Banana

By Google

Google's image editing model, accessible through Gemini, and currently the best AI tool available for editing existing images rather than generating new ones from scratch. Where Imagen 4 generates from text prompts, Nano Banana takes an existing image and modifies it — precisely, intelligently, and with contextual understanding that earlier editing models lacked. Remove an object from a photograph. Change the background of a participant photo while keeping the people intact. Combine elements from multiple reference images into a coherent composite. Adjust lighting, colour, and composition on existing visuals. For youth workers who have real photographs from their programmes and want to adapt them for communications materials, this is more practically useful than any generation-only tool.

Best for: editing existing images, background removal, compositing, adapting programme photographs

Integrated · Adobe CC

Adobe Firefly

Within Creative Cloud

If your organisation already uses any Adobe Creative Cloud tools — Photoshop, Illustrator, Express — Firefly is built in. Its critical advantage is commercial safety: Adobe trained Firefly exclusively on licensed content, making it the only major AI image generator where you can use outputs in published materials without copyright concerns. The Generative Fill tool in Photoshop — extending, replacing, or modifying specific parts of an existing image — is among the most practically useful AI image features available alongside Nano Banana for editing workflows.

Best for: commercial safety, Adobe workflow integration, editing existing photographs

Dedicated platforms

Standalone

Midjourney v7

The undisputed benchmark for artistic and editorial image quality. Produces images with a distinctive aesthetic that feels directed and intentional rather than generated. Excellent for programme visual identity, mood boards, and any image where you want something that looks like it was made by a creative director. Now has a proper web interface — no longer requires Discord. Starts at $10/month. Note: images generated on the free tier are publicly visible on Midjourney's explore page — a consideration if you are working with sensitive material.

Best for: artistic quality, editorial aesthetics, visual identity work

Standalone

Ideogram 3

The strongest image generator for anything that includes readable text — posters, banners, certificates, workshop slide graphics, social media posts with headlines. Every other major generator struggles with text in images, producing distorted or misspelled words. Ideogram was built specifically to solve this problem and does it better than any competitor. Free tier gives 10 credits per week. From $8/month for full access. For youth work materials that combine visuals and text — which most programme graphics do — this is the most practical standalone tool.

Free tier: Yes — 10 credits/week Best for: posters, certificates, social content, any image with text

Standalone · Open Source

FLUX

By Black Forest Labs

The leading open-source image generation model and the best all-round option for photorealism as a standalone tool. Available through multiple third-party platforms and APIs, meaning you can access it without a dedicated subscription. For organisations with technical capacity, it can be self-hosted, meaning participant data never leaves your infrastructure. Legal for commercial use. Not the most accessible for non-technical users, but for organisations where data privacy and cost control matter, it is the most flexible option available.

Best for: photorealism, privacy-conscious organisations, API and technical use, self-hosting

05.4 Video

Moving images, without a camera crew.

AI video generation has crossed a threshold in 2026. The outputs of the leading models are cinematic, physically believable, and increasingly indistinguishable from professionally shot footage. For youth workers, the immediately relevant use cases are programme explainer videos, social media content, training materials, and accessible documentation of activities.

Video generation still requires more patience and prompt craft than image generation — getting consistent results across multiple clips, maintaining character identity, and directing camera movement all take practice. But the quality ceiling has risen dramatically, and even basic use of these tools produces results that were not achievable without professional equipment two years ago. Native audio generation — where the model generates synchronised sound effects and ambient audio alongside the video — is now standard in the leading tools.

For youth workers new to AI video, the most practical entry point is not generating scenes from scratch but animating existing images or creating short explainer clips for programme documentation. Most tools allow image-to-video generation — turning a still photograph or AI-generated image into a short clip with controlled camera movement. This approach gives you much more control over the output than text-to-video and produces consistently better results for non-specialist users.

Before you start

Video generation is significantly more expensive than image generation — in credits, in money, and in data. A single 10-second video clip consumes the equivalent of dozens of image generations in most platforms' credit systems. Free tiers are limited: most give you enough to test the tools, not enough to produce programme content at scale. Paid plans for quality video generation start at $12–$20 per month and credit consumption is fast.

There are also bandwidth and device considerations that matter directly in youth work contexts. Generated video files are large — typically 50–200MB per clip — which means downloading, sharing, and working with them requires reliable internet and reasonable device storage. For participants or co-facilitators accessing materials on mobile data plans, in rural areas, or on low-end devices, video-heavy content creates a real access barrier that image or text content would not. Be deliberate about when video genuinely serves the work better than a strong image or a well-written text — and be honest about whether the people you are working with can realistically access video content in practice.

Sora 2

By OpenAI

OpenAI · via ChatGPT

OpenAI's video model, accessible through ChatGPT subscriptions. The most narratively intelligent video generator available — excels at character emotion, dialogue-driven scenes, and coherent storytelling over longer clips. Up to 20 seconds per generation. The conversational ChatGPT interface makes it the most accessible entry point for youth workers already using ChatGPT. ChatGPT Plus ($20/month) gives limited access; ChatGPT Pro ($200/month) gives full access. The integrated approach means you can generate text, images, and video within the same tool and conversation.

Best for: narrative content, accessible starting point for ChatGPT users, dialogue-driven scenes

Veo 3.1

By Google

Google · via Gemini

Google's video model, accessible through Gemini Ultra and Google AI Ultra subscriptions. Currently the strongest for physical realism, accurate lighting, and precise prompt adherence — consistently outperforming competitors on benchmark tests measuring how accurately a model follows complex multi-element instructions. Native audio generation is integrated and high quality. For youth workers in the Google ecosystem, Veo 3.1 via Gemini is the most natural access point. Also available through Google AI Studio for developer and API use.

Best for: physical realism, precise prompt following, Google ecosystem users, native audio

Runway Gen-4.5

Standalone

The agency and filmmaker standard for AI video — not because it always produces the most photorealistic output, but because it gives the most control. Motion brushes allow you to specify exactly which part of an image should move and how. Character consistency across multiple shots is the strongest of any platform. From $12/month for Standard plan, with 4K export on paid plans and generations up to 40 seconds. The most powerful tool for youth workers who want to produce polished, multi-shot video content rather than single clips.

Best for: creative control, multi-shot content, character consistency, polished production

Kling 2.6

Standalone · Free Tier

The most cost-efficient professional video generator and the only platform currently capable of generating continuous video up to 2–3 minutes — compared to 20–40 seconds for all competitors. For youth work use cases — programme explainer videos, activity documentation, social media content — the longer format is genuinely useful. Strong lip-sync for dialogue-based content. Free tier available with watermarks. Chinese-developed — same data jurisdiction note as DeepSeek applies here.

Free tier: Yes — with watermarks Best for: longer video content, social media, cost-conscious organisations, free tier testing

05.5 Voice

Voices in, voices out.

Voice AI splits into two distinct directions — tools that convert text into spoken audio (Text to Speech), and tools that convert spoken audio into text (Speech to Text). Both have mature, accessible options in 2026 and both have concrete, practical applications for inclusive youth work.

Text to Speech

Text-to-speech has crossed the uncanny valley. The best TTS tools in 2026 produce audio that is genuinely difficult to identify as AI-generated, with natural emotional range, accent variety, and conversational rhythm. The practical applications for youth work are significant: creating audio versions of written materials for participants with low literacy or visual impairment, generating multilingual narration for programme content, producing accessible versions of evaluation forms, and creating audio resources for participants to use after the programme ends.

Standalone · Best Quality

ElevenLabs

The benchmark for voice realism in 2026.

The benchmark for voice realism and emotional range in 2026, consistently ranked first in independent quality tests. Covers 70+ languages. Voice cloning from a short audio sample. The inline emotional direction system — specifying whispers, emphasis, tonal shifts — gives granular control unavailable elsewhere. Voice design tool lets you create custom voices by describing characteristics (age, accent, gender, tone) rather than cloning a real person. Free tier: 10,000 characters per month. From $5/month for starter.

Free tier: Yes — 10,000 characters/month Best for: quality, emotional range, voice cloning, custom voice creation, audiobook-style narration

Standalone · Creator Focused

Murf

Built for content creators, not developers.

Built for content creators rather than developers — comes with a timeline editor that syncs audio to video, a Google Slides add-on for voiced presentations, and word-level control over pitch, emphasis, and pacing. 120+ voices across 20+ languages. Less emotional depth than ElevenLabs but a far more integrated production workflow. The Google Slides integration is especially practical for youth workers creating voiced programme presentations. Integrates directly with Canva and PowerPoint. From $29/month for creator plan.

Best for: programme presentations, voiced slide decks, integrated production workflows, team collaboration

Integrated · ChatGPT

ChatGPT Voice Mode

Lowest friction entry to voice AI for ChatGPT users.

For youth workers already using ChatGPT, voice mode is the lowest-friction entry to TTS — speak to it, hear it speak back, use it for conversation practice or language learning. The most natural voice flow and personality of any conversational TTS, making it genuinely useful for practising facilitation scripts, role-playing participant scenarios, or language exchange exercises. OpenAI's TTS API also powers many third-party apps at low cost. Available on ChatGPT free and Plus tiers.

Free tier: Yes Best for: conversational use, language practice, facilitation rehearsal, existing ChatGPT users

Speech to Text

Speech-to-text has equally practical applications: transcribing group discussions for documentation, generating accessible text records of verbal session content, supporting participants who express themselves better verbally than in writing, and enabling real-time captioning for participants with hearing impairments. The tools split into consumer-facing platforms and developer APIs — the distinction matters depending on whether you need a ready-to-use product or are integrating transcription into a larger tool.

Consumer · Meetings

Otter.ai

The most accessible starting point — no technical setup required.

The most accessible starting point for youth workers — no technical setup, integrates directly with Zoom, Google Meet, and Microsoft Teams, automatically joins calls and generates transcripts with speaker identification, automated summaries, and action item extraction. Ideal for transcribing planning meetings, partner organisation calls, and virtual sessions with participants. Free tier gives 600 minutes per month. From $16.99/month for unlimited on paid plans.

Free tier: Yes — 600 minutes/month Best for: meeting transcription, virtual sessions, team collaboration, no-setup required

API · Open Source

OpenAI Whisper

97 languages — broadest multilingual coverage of any tool.

The open-source transcription model that powers many consumer apps under the hood. Supports 97 languages — the broadest multilingual coverage of any transcription tool, making it uniquely valuable for the multilingual groups typical of Erasmus+ projects. Available free as a self-hosted model or via OpenAI's API at $0.006 per minute. Best in class for multilingual audio including code-switching — participants moving between languages mid-conversation. The right choice for technically capable organisations or for any workflow involving non-major languages.

Free tier: Yes (self-hosted) / $0.006/min API Best for: multilingual transcription, 97 languages, budget API use, open-source deployment

API · Developer

AssemblyAI

The most feature-rich transcription API.

The most feature-rich transcription API — goes beyond transcription to offer speaker identification, sentiment analysis, topic detection, content moderation, and automated summarisation within a single API call. Universal-2 model achieves approximately 8.4% word error rate, strong across diverse audio conditions. From $0.00249 per minute, cheaper than Whisper at API scale. The right choice for organisations building transcription into their own tools or needing more than raw text — for example, automatically identifying key themes from participant verbal feedback across multiple sessions.

Best for: developer integration, speaker diarisation, automated analysis, building custom tools

The most important question to ask about any AI tool in youth work is not "is this impressive?" It is "does this make the work more inclusive, or less?" Used deliberately, AI can lower barriers that have always existed in group learning environments. Used carelessly, it can add new ones.

05.6 Inclusion

AI as a barrier remover.

The digital divide is real and it matters for youth work. Participants from lower-income backgrounds, rural areas, or countries with limited digital infrastructure may have limited access to smartphones, data plans, or reliable internet — which means that AI tools which require constant connectivity or expensive devices are not universally accessible. Being conscious about which tools you require participants to use, and providing alternatives for those who cannot access them, is a basic inclusivity responsibility that the excitement around AI makes easy to overlook.

That said, the specific ways AI is changing accessibility in group learning contexts are genuinely significant. Real-time translation, adaptive text complexity, voice-to-text for non-writers, text-to-speech for non-readers — these are not marginal edge cases. They are directly relevant to the multilingual, mixed-ability groups that characterise inclusive Erasmus+ projects. What follows is not speculative — these are tools that exist, work reliably in 2026, and have concrete applications for the groups you work with.

Live translation for multilingual groups

In a group with participants from five countries speaking different first languages, following a session conducted in English is a significant cognitive burden for non-native speakers. They are processing content in a second language, formulating responses in a second language, and often missing nuance that native speakers catch effortlessly. Real-time AI translation changes this significantly.

Microsoft Translator and Google Translate both offer live session translation where participants can follow a live speech in their own language on their own device in real time. The facilitator speaks — participants read along or listen in their preferred language simultaneously. This does not replace language learning, but it removes the comprehension burden during sessions that are not about language. For participants with very limited English, it can mean the difference between genuine engagement and polite endurance.

Tools to use:

Microsoft Translator (live sessions) Google Translate (conversation mode) Wordly (events & conferences)

Adapting text complexity for different literacy levels

Written programme materials — session descriptions, evaluation forms, group agreements, handouts — are typically written at a literacy level that assumes secondary education and comfortable written English. For participants with lower literacy, limited educational backgrounds, or processing differences, this creates a barrier that is invisible to the facilitator and deeply felt by the participant.

Any of the major AI writing tools can take a piece of text and rewrite it at a specified reading level in under thirty seconds. "Rewrite this in plain language for someone with basic reading skills" or "simplify this to B1 English" produces a usable alternative in moments. The same tools can generate visual summaries, bullet-point versions, or simplified versions in multiple languages simultaneously. Done in advance of sessions, this removes a barrier that would otherwise require a specialist in accessible communication.

Tools to use:

Claude ChatGPT Gemini any writing tool with explicit adaptation prompts

Voice-to-text for participants who struggle with writing

Written reflection, evaluation forms, and feedback tasks consistently disadvantage participants who express themselves better verbally than in writing — which includes people with dyslexia, low literacy, or simply different cognitive strengths. Asking these participants to write their reflections produces thin, guarded outputs that do not represent their actual thinking. Asking them to speak and having AI transcribe and organise what they say produces something much richer.

On any modern smartphone, native voice dictation — powered by Whisper or similar models — can be activated in any text field. Participants can speak their reflection directly into a shared document, an evaluation form, or a messaging app. This requires no special software — just awareness of the feature and explicit permission and encouragement from the facilitator to use it.

Tools to use:

Native smartphone dictation (iOS/Android) Otter.ai OpenAI Voice Mode

Audio and visual materials for different learning styles

Standard programme materials are text-based, which suits visual learners comfortable with reading but disadvantages auditory learners, kinaesthetic learners, and participants with dyslexia. AI makes it practical to create multiple format versions of the same content without the time investment that would previously have made it impossible.

ElevenLabs or Murf can convert any written resource into a natural-sounding audio version in minutes. ChatGPT or Gemini can generate an illustrated visual summary of a text document. Nano Banana can adapt and edit existing photographs to better represent the group's diversity. Ideogram can create a visual map of a day's programme that participants can reference without reading. These are not accommodations for specific diagnosed needs — they are universal design improvements that benefit all participants and specifically support those for whom text is not the optimal format.

Tools to use:

ElevenLabs Murf (audio versions) ChatGPT / Gemini (visual summaries) Nano Banana (visual adaptation)

Culturally sensitive and contextually adapted content

Standard youth work materials — scenarios, examples, discussion prompts, case studies — are frequently built on cultural references and assumed common knowledge that do not travel well across cultures. A scenario set in a Western European urban context may be meaningless or alienating to a participant from rural Romania, Azerbaijan, or Turkey. Manually creating culturally adapted versions for every cultural context in your group is not feasible.

AI writing tools can adapt any scenario, example, or prompt to a specified cultural context in moments. "Rewrite this scenario for a rural context in Eastern Europe" or "adapt this discussion prompt to make it relevant to young people in Turkey" produces a usable adaptation that a facilitator can review and refine. This is not about stereotyping — it is about replacing generic Western defaults with specific, relevant contexts that make the material legible to more participants.

Tools to use:

Claude ChatGPT Gemini any writing tool with explicit cultural adaptation prompts

Real-time captioning for participants with hearing impairments

Live captioning — displaying real-time text of what is being spoken — has been the gold standard for deaf and hard-of-hearing inclusion for decades. It has historically required specialist human stenographers at significant cost. AI has made real-time captioning free and widely available on consumer devices, removing a barrier that previously made inclusive facilitation of participants with hearing impairments financially prohibitive for most youth organisations.

Google's Live Transcribe (Android), Apple's Live Captions (iOS and macOS), and Microsoft's Captions feature in Teams and PowerPoint all provide free, accurate real-time captioning directly on participants' devices. For video calls, most platforms now include AI captioning by default. For in-person sessions, a facilitator can project live captions on a shared screen. This is a zero-cost, immediate improvement that requires nothing more than knowing the feature exists and turning it on.

Tools to use:

Google Live Transcribe (Android) Apple Live Captions (iOS/macOS) Microsoft Live Captions (Teams/PowerPoint)

Workbook complete

You've reached the end of the Youthwork Playground facilitator workbook.