enTalkenator: a tool for thinking
I've made a new thing. I think it's amazing. It's the kind of project that would have seemed like magic when I was a child. As few as five years ago, I would never have dreamed I'd ever even use such a thing, let alone make it. The app is called enTalkenator, and while it feels to me like magic, it also kind of sucks. I can't wait for it to be better.
enTalkenator Is Magic
enTalkenator is a soon-to-be-released app (UPDATE: It's out!) that turns documents into clean transcriptions, translations, narrations, and entirely new conversations and articles. You give it a PDF or some text, and it can generate a faithful audio narration, an academic workshop, a podcast episode, a critical reply, a first draft, or whatever else you can dream up. I've always wanted a straightforward app that would turn academic PDFs into listenable audio — stripping out headers, footnotes, and the like — without fuss. A "podcast" app for articles. With LLMs this is now possible. Not only that, but you can drop in a Spanish text (even a handwritten one) and get an English or German text out.
Have a half-baked draft, an outline, or some notes? Drop it in (perhaps without any AI cleanup at all), and then select the option to generate a first draft, a workshop, a debate, or whatever else you want. Have a chapter of textbook and want to generate a summary, an explainer, or even a paper? It can do that. When I was showing a friend the app, I pasted in the Gettysburg Address, imported it in Icelandic, and then generated a 5,000 word structured critical response. Just today, I created a two and half hour narration of Larry Solum's selected download of the week from his legal theory blog — the original use case that inspired the app.
All these conversations and articles can be brought to life with good AI voices. And you can share what you've made — text and audio — with others, and, likewise, you can import what others have made. Once one of your colleagues, classmates, friends, or family has created things in enTalkenator, they can freely share them.
The generated conversations and articles are produced by templates in the app. It ships with several such templates, like the ones just mentioned. But you can edit them, make new ones, import others' templates, and share templates with others. You could make an outliner, an exam maker, an exam with answers maker, a heated argument, a peer reviewer, a doctoral defense, an "explain it to me like I'm five" generator, a fiction writer, whatever.
That's the magic part. And turning that magic into a working app was relatively easy. Turning it into a polished app (or as polished as I can make it at the moment, see below), well that's been a ton of work. But it's here. It works. I love it, and I think you will too, despite the downsides.
enTalkenator Kind of Sucks
So what's wrong with it? Well, first, setting it up is more fiddly than I'd like. This fiddliness is a classic barrier: deterring people from overcoming it, even though everything is easy to use once you've done so. Upshot is that to use enTalkenator with the various AI services, you need to enter your API key for those services.
You have to obtain these keys through webpages that are not directly connected to your existing subscriptions (if you have them). So even though you have a ChatGPT account, for example, you still need to go and set up your OpenAI API access, add some money to the account, and copy a key to paste into enTalkenator. It's actually super simple, but it doesn't sound like it to a new user. Yes, you only need to set up one of these, and, yes, once you set it up you don't have to think about it again until you want to add some more money to the account. But it'll turn many potential users off.
(An aside: I wish OpenAI, Google, and Anthropic (and DeepSeek) would call them App Keys — as they're the keys to being able to call the services from an app. As a developer, I think in terms of using their APIs. As a user, though, you're thinking of using them in the app. Even better would be a single sign-on api, so that I could just have the user sign-in to a service in my app. And even better than that, make App Access an option within the standard Anthropic and OpenAI subscriptions — all accessible from the same interface.)
I've spent a lot of time trying to reduce this friction, but it's still there. The alternative is to run my own server, handle a key or organizational key that the users securely retrieve and send, manage rate limits for the whole app, charge for in-app tokens that then cover the LLM bills, etc. etc. Upshot is that I'd be putting myself between the user and the LLM in a way that comes with all sorts of business and technical headaches and privacy implications. There are potential solutions on the horizon, but as of now I haven't found one that covers the globe, is secure and private, and allows me to deliver the app without ever collecting any data from the user at all. So API keys it is.
How much does it cost? Audio is provided only by OpenAI for now and is about 85 cents per hour. (If you don't opt to generate audio for a document, you can still listen to it using the iPhone's built in speech synthesizer. It doesn't sound great, but it's there. And you can opt to add AI audio at any time.) Generating text varies in price by model. Google's models can be used without charge (but you might have to wait longer, as the rate limits are lower for the free tier). Anthropic's Claude Sonnet 3.5 is still maybe the best text producer for many kinds of text. It costs about 40 cents to produce a transcript of a roughly 35 or 40 minute faculty workshop. It only costs about 11 cents to use the much-discussed DeepSeek R1 to do the same, and a few pennies to use DeepSeek's somewhat lesser model. It's free with the free tier of Google's Gemini.
When you ask enTalkenator to clean up an article for narration or ask it to generate a script, it gives you an estimate of the text and audio costs. If you're generating a workshop, say, you can turn the audio off and just generate a script. You can always hit the waveform button next to the workshop to generate audio later. And if you're loading in a PDF or copying in text from the web, you can turn both off and just get the text you selected or the text stripped from the PDF without AI processing. If what you want to do is generate workshops, oral arguments, or drafts from what you're adding (rather than listen to what you're adding in a clean, audiobook-like way) then don't clean it up at all, just generate. The AI that creates your conversations won't care if the text it's using is a bit messy.
The second issue that sometimes leads to enTalkenator suckitude is that relying on LLM services means relying both on network access and on the providers' abilities to maintain speedy uptime. DeepSeek, for example, has been down — or at least unreliable — for a few days as I write this. I cannot control that. And I cannot control the speed: sometimes the models are slow or you might hit rate limits that force the app to wait for a few seconds. Most of these services have "tiers" that give you higher rate limits the more you spend.
Nor can I precisely control whether the models generate amazing things or mere slop (more like enSlopenator...). Better models yield better results. And as models get better, faster, and cheaper, so will enTalkenator. My advice is to try all of them and see how it goes. I put a little money in several services and have had a blast testing them all.
enTalkenator Still Feels Like Magic
I still can't believe I'm able to use an app like this. It's like having a podcast app where conversations come from whatever you give it — no matter how ridiculous. I shared with my family the faculty workshop it generated based on this input: "On poo. By James Thompson. Poo poo and pee pee are funny. This is a serious academic work." While the workshop was more meta than I'd have liked, I could always have another go at it.
I've subtitled the app "a tool for thinking," because that's how I envision it and use it. Go deeper on anything by immersing yourself in the on-demand conversations and articles it makes about whatever interests you. Can it be superficial or wrong? Of course. But that's where you come in.
enTalkenator is free to use to generate transcripts and audio narrations of PDFs or text. (Obviously, you have to pay for the LLM services as described above, unless you're using Google's free tier.) It's also free to export and import enTalkenator files. So if someone else makes a bunch of conversations, you can import them and listen to and read them without subscribing to the app. It's $1.99 / month to use the conversation generation features.
Making enTalkenator an app rather than a web service is a bet that AI is going to get faster, cheaper, and better. The convenience and niceness of the app experience is now possible. I'm betting it's going to become ever more so.
I hope you try it out. It ships with sample content, and I'll probably have some .enTalkentor files you can download over on https://entalkenator.com. Happy generating!