Skip to content
    AI engineering roles via the DiamantAI Collective.See open roles
    CreativeGenAI Agents

    TTS Poem Generator

    This tutorial guides you through the process of creating an advanced text-to-speech (TTS) agent using LangGraph and OpenAI's APIs. The agent can classify input text, process it based on its content type, and generate corresponding speech output.

    In the era of AI and natural language processing, there's a growing need for systems that can intelligently process and vocalize text. This project aims to create a versatile TTS agent that goes beyond simple text-to-speech conversion by understanding and adapting to different types of content.

    What you'll learn

    • 1
      Content Classification: Utilizes OpenAI's GPT models to categorize input text.
    • 2
      Content Processing: Applies specific processing based on the content type (general, poem, news, or joke).
    • 3
      Text-to-Speech Conversion: Leverages OpenAI's TTS API to generate audio from processed text.
    • 4
      LangGraph Workflow: Orchestrates the entire process using a state graph.

    About this tutorial

    This hands-on Jupyter notebook is part of GenAI Agents, a free open-source repository by Nir Diamant covering ai agents techniques with runnable code examples and detailed explanations.

    Free and open-sourceRunnable Jupyter notebookActive community support
    Go deeper · Amazon Bestseller in Generative AI

    RAG Made Simple

    Nir Diamant's complete visual guide to Retrieval-Augmented Generation — essential for any GenAI engineer building systems that retrieve and ground responses on real data.

    Get it on Amazon

    ⭐ 4.4 stars · 1,500+ readers · Kindle $9.99 · Paperback $24.99 · Free with Kindle Unlimited

    More Creative tutorials

    More from GenAI Agents