"Don't You Just Upload It to ChatGPT?"

The ending is a really powerful point. Most people apparently agree on two things:

1. AI is a great boon for all tasks and specialties we don’t have the skills to do ourselves. Understandable, since (A) we’re ill equipped to see the flaws in its output because it isn’t our area of expertise, and (B) it often can unlock great gains because if we trust it, we then don’t have to pay and wait for humans to do that thing.

2. AI is a terrible replacement for me - my skills are at such a high level that it’s almost theoretical that it’ll ever be good enough to replace me for 90% of what I get paid to do. It’s a tool at best.

This is why I use AI for all my medical questions and doctors use AI to write software, and we both smirk at the quality the other person is getting from it.

Slight tangent into translations:

I read two translations of the book "The Master and Margarita". My first read was so boring I couldn't help but stop reading before the end of the first chapter. I can't find the copy and the name of the person who translated it, but this one had all the Russian nicknames translated. It kept talking about a guy called homeless. I thought it was just a bad book and dismissed it for years. I couldn't understand what all the fuss was about with this book.

But then, I stumbled upon the translation by Diana Burgin and Katherine Tiernan O'Connor. Although I don't speak Russian, I think this is as good as it gets. They did a phenomenal job.

You can see the same effect with the mechanical translation of the book "We" by Yevgeny Zamyatin, where the government is called "United State" easily confused with the "United States". The translation that called it "One State" was so much better.

An honest to god article full of em dashes that's not because it was AI but because it was a human using them as a crutch to get around crafting sentences that flow naturally. Almost brings a tear to my eye.

You'd be laughed at if you said that ChatGPT could help you with graduate level mathematics in 2024, but this year, AI models on simple prompts are solving previously unsolved Erdos problems.

It seems silly to imagine that there is some fundamental barrier between human intelligence and AI, and that AI could never do many of the things that humans can do. Inferring intent, gauging sentiments, factoring in cultural values, etc. all the things cited as stuff humans can do but AI can't, AI can currently do if given enough context. But more importantly, all those things aren't magical tasks that can only occur inside a human skull, they are a product of information processing, its just the information processing that has been hard to make computers good at, but so far it appears AI keeps getting better.

I'm all for humans having special value that is not attached to their ability to perform useful work. However denying the abilities of AI models seems to be a common mistake many people are making, and sadly reality catches up to these people before they can emotionally prepare.

I have no doubt that the writer is better at translating than AI, but I have to say that AI translation has gotten so good that I'm not sure how much longer translation work will be there, or rather it might end up being more about auditing.

For example, I just read the Lawrence Ellsworth translation of The Three Musketeers, which I very thoroughly enjoyed. I don't speak or read French, but from my understanding Ellsworth's translation is considered one of the more accurate translations of the work.

Out of curiosity, I sic'd Claude Fable on the original French version of The Three Musketeers and told it to translate accurately, but also try and keep the same jovial tone as the original and do not censor anything. After it was done, I didn't read the entire output, but I did compare a few individual chapters between the Ellsworth translation and the Fable translation.

They were honestly remarkably similar. As far as I could tell, nothing was substantially different from the Ellsworth translation and the Fable translation. I do think that the prose for the Ellsworth translation was a bit better, but the prose for the Fable one was actually perfectly readable. Again, I don't speak French so I cannot say for sure, but I do not believe that I would have gotten a significantly different experience had I read the Fable version instead of the Ellsworth version.

Now, it's possible (and likely) that this is somewhat self-fulfilling; Fable might have been trained using Ellsworth's translation and as such it's very directly able to crib from it; sadly since I do not speak any language outside of English, there's sort of a catch-22: the only way I can compare the accuracy of a translation is to compare against other translations, but if other translations exist then that will likely influence the results, and if a translation doesn't already exist then I have no way of auditing it.

I'm still going to continue reading through Ellsworth's translations for the subsequent stories simply because that feels more canonical, and as I said I do think the prose was a bit better.

I think it's an interesting perspective, because translation is one of the jobs that I (a) hear is the first to lose work due to AI, and (b) often used as an example of "acceptable" AI by people who are skeptics of LLMs and AI-generated art.

Out of curiosity, I pasted an article in French I was reading a few minutes before coming across this thread into ChatGPT and asked for a translation into English. It was certainly passable from a functional perspective, and I wouldn't hesitate to use it to translate an article from a language I don't understand. But it was not professional-quality work. There were a couple instances where the French grammar was mistranslated, and the writing was perfunctory, not going into any effort to have the article flow like it was originally written in English instead of simply translating each sentence literally. Would I read an article written like this? A short one. A novel? Definitely not.

"we all more or less look the same in gym clothes"

Maybe my brain works differently than the author, but I'm surprised at this statement. Gym clothes don't change recognition for me, it's about the face, body, posture, clothes don't really enter into it. For me it is nonsensical enough to be suspicious.

And for a human centric perspective, not recognizing who someone is sad, it's knowing that you probably won't meet them again so it's not worth it, the community isn't there. Where community and interpersonal relationships between people are something we still hold dearly.

I don't see LLMs being able to replace translators for less-spoken languages.

I know a translator between two Eastern European languages, and some jobs require use of specialized dictionaries. Using LLMs in such cases would be very unreliable and would require even more effort to check and correct than doing it correctly in the first place. Plus, I really doubt that US tech firms are training LLMs on language spoken by "only" 6 million people.

As for entertainment, anyone who grew up in Eastern Europe with pirated movies with nasal monotone translations, or machine-translated video games knows how much those take away from the experience. Sure, "AI could do better", but could it be consistent and capture cultural nuances and idioms, etc?

I say it’s a simple value proposition.

A few examples

Audio book narration. Human narrators are paid a seemingly ridiculous amount of money to literally read a book out loud. We have the tech to replace them, it’s actually pretty dang good, and it is substantially cheaper to do with computers. It’s pretty accurate too. In the audio book industry though, if you take your book seriously you have a real person read it. The best one you can find that you like. Readers enjoy hearing good narrators and the total value one narrator can bring is very high mostly because the value scales well.

Another real world example that doesn’t scale well, call centers. Customers want humans, but execs have tried to replace them with automation in every way possible. The margins of a business get squeezed because the value of the human touch doesn’t scale well in this case.

Translation falls a bit in the middle. I’m sure ChatGPT is good enough for some people. If you are a restaurant and need to understand what you are ordering at the local authentic Italian restaurant it’ll do the job. If you have a bad food allergy? Maybe not, you are willing to pay for accuracy because that’s what a human brings

So the answer to the question posed in the article, can’t you just upload it to ChatGPT? Maybe yea maybe no

What’s unfortunate is that the market that is willing to pay for high-quality human translation has shrunken considerably.

The most important thing a human translator does is certify that the translation is faithful.

Period.

You could do a machine translation if you want, but you better pore over every word in case you end up on the witness stand.

One of my parents tried this to beat a deadline for product packaging.

There are now bags being sold marked "Lawn Suits", when it was supposed to be Lawn Topdressing

I had transliterated lyrics of a song * with stanzas in Urdu , Braj Basha, Persian and Arabic , that I wanted to understand better ..

Gemini did a pretty good job of translating this to English .

Sure a professional human translator would have done a more nuanced job if I was willing to invest the money and time . But ...

* tajdar e haram originally by Payam Saihalwi, later versions by the Sabri Brothers and recently by Asif Aslam

I recently saw a video showing the french to german translation of a french McDonald's terminal. The translations were hilarious bad, like old school google translate bad.

Maybe McDonalds is big enough to not care about their reputation, maybe they are happy about the free clout from people making fun of them but they certainly chose to cheap out on translations.

https://www.tiktok.com/@denneshow/video/7522160205501566230

As a public service employee within the GOC, I feel the pain expressed by the author. I sat through a meeting today where somebody with no domain knowledge puffed up their chest to show off their gpt created master lesson plan for a four year long internal training plan that is being re-worked.

I could feel the heads of those around the table that had been teaching this material for a decade starting to explode as this was exactly what others in the thread have described: it looked good until vetted by experts, then it was easy to poke holes as it was just not right

The problem in the public service is that the experts who can review the output are leaving or being nudged out.

I worked at large Japanese bank in New York and happened to sit near Chief US Economist next to his Japanese translator. She would occasionally ask about certain idioms. I remember explaining what a wildcat strike was for instance. But it must have been pretty tough because the guy was prolific in his commentary.

Sounds a aweful lot like the kind of things we were all saying before realising that we had to change what our jobs meant.

So i assume this post is just a bit of writing out frustration, but i'm always hoping that "AI can't do it" posts to include examples.

A list of "Examples AI will silently fail at" would be a lot more interesting, and might just convince your next potential client to _not_ use AI.

I'm gonna sound a bit like the clueless gym hr lady: I assume most income generating translation jobs are either mandated by law or commercially high stakes enough to warrant a human to do, no? Were people really being paid to do the type off _low stakes_ translations implied that a automated system can replace?

Maybe a publisher will replace the translator of the next Dan Brown best seller with Mythos? Who cares other than those buying it, getting money out of it?

> If you ask me, nothing can save downtown Ottawa or North American public transit.

Come to Montreal. Only 2H away and you can get by decently well without a car.

I can’t believe this article hasn’t been written by ChatGPT. The author claims to have written it but has clearly become completely captured by the awful generic style of AI writing.

unfortunately this person will soon be unemployed.

not because their skills are no longer relevant, but because they are taking a principled stance defending now irrelevant skills.

Presumably the people paying the author for translation services are aware of AI, but for whatever reason are choosing a humans services instead. IMO it would be a form fraud to heavily rely on AI and not disclose that to the customer.

> Should you pay your roofer less because he uses a hammer instead of his bare hands?

Yes. Effective tools increase the supply of roofs made. More supply means lower prices per roof. But because the same number of roofs need to get worked on, the increase in roofs per roofer means less roofers will be needed.

    AI isn’t replacing me. Like a toddler, it
    needs to be constantly coached.

Like a toddler, it will grow up.

Humans are really bad at noticing trajectories. They see the current situation. They know what the situation was 5 years ago. But for some reason they do not believe that there is a trajectory. They view the present state as the final destination.

It's quite ironic as the transformer architecture that powers most generative AI was invented for language translation :)

Safe to say OP just does NOT like AI https://correresmidestino.com/sorry-i-was-busy-unfucking-my-...

Poor woman should really look into pivoting her career or finding a different way of making money. Truth be told, her industry/career is not going to get better. Consistent work will just not fall from the sky.

Being bitter will not improve her situation. Even organizations like UN/OECD are looking into implementing AI in various ways.

Really good blog though. I love life blogs like these! You can go back and live through so many interesting/pivotal moments.

LLM's are in fact very good at translation and transliteration.

As a former freelance translator (1986 to 2005, Japanese to English), I have much sympathy for the writer. But I wouldn’t be so confident that AI cannot do professional-level translation.

She writes: “I adapt, I localize, and I find the best way to convey the original message so it makes sense and feels natural. I research terminology. I make sure it’s consistent throughout.”

I’m sure she has other important insights into what enables her to do her job well. The problem is whether or not such insights can be incorporated into an AI-driven translation system, too.

Since early this year, I have been experimenting with a variety of agentic systems for language-related tasks, including dictionary-writing, research on topics in the philosophy of language, essay-writing, and translation. Other than the dictionary [1], I am keeping the results private, so they haven’t been evaluated by others. But my personal assessment is that agentic systems given suitable high-level guidance can be very good at such tasks now.

If I were still freelancing and I had a large translation job to do for a client, here is the outline of the prompt I would give to Claude to get it started:

“Use this private GitHub repository to build a system for translating [genre of text] from [Language1] to [Language2]. The directory samples/ contains examples of the type of document to be translated, high-quality human translations of those documents, and texts in [Language2] that are in writing styles that I believe to be appropriate for this genre of translation. The file guidelines.md contains my general instructions about the needs of my client and my preferences for how you should translate texts along various axes (natural vs. literal, informal vs. formal, preferred dialect in [Language2], consistency vs. variety in terminology translation, etc.). Begin building (1) a knowledge wiki for this project using Karpathy’s LLM-wiki framework and (2) a system inspired by Karpathy’s Autoresearch, AutoResearchClaw, etc. for testing and recursively improving both the functioning of the system and the quality of the translations. For the actual translation, editing, checking, etc., use not only your own ability and the knowledge assembled in (1) but also outsource such tasks to other frontier models through OpenRouter, and use adversarial evaluations among those models and yourself to check and recursively improve the system design, the prompt-writing for other models, and any translations created by the system. My OpenRouter API key is available in this environment. You may spend up to $xx per day in API calls until this project is ready to do real translations; before beginning a real job, give me an estimate for how much the API calls will cost for that job. The initial build-out of this project will take many sessions, so write a prompt called resume-prompt.md that I can point you to at the start of a scheduled Routine to have you work on this. Commit and squash-merge to main at the end of each session. I will be checking in occasionally to view your progress and to ask you to run translation tests, and I will offer guidance then on how to improve the pipeline further and make the translations closer to what my client needs. If you have any questions before you begin, please ask me.”

[1] https://www.tkgje.jp

Reminded me of this quote:

"Expertise in one field does not carry over into other fields. But experts often think so. The narrower their field of knowledge the more likely they are to think so." - Robert Heinlein

In this case, the gym buddy doesn't think that she's an expert in the other field, but dismisses it as something ChatGPT can do with ease.

Denial isn't just a river in Egypt

> “Great. So, do you use AI a lot at work?”

> “Oh, I can’t! It’s really not reliable enough.”

Gell-Mann Amnesia strikes again.

Who is gonna tell her?

the version of this skillset that stays employed is "now I translate 10000x more than i could before by managing a fleet of agents. by encoding my experienced taste and judgement into robust evals, I've helped my ai translators be far better than chatgpt on its own, and much more cost effective compared to manual human translation"

unfortunately this person will soon be unemployed.

AI should be used for all the bullshit tasks that no one wants to do. There are garbage dumps full of stuff that can be reused and recycled. But it's not high enough ROI to pay someone $25/h to sort trash, so it isn't happening.

You don't even need to argue that you're better than the AI. The point is that the client could have uploaded it to ChatGPT too. Perhaps they even did, and they didn't like the answer they got. They are sending it to a human because they want a human to do the work. If you were to send back ChatGPT output, that would be fraud.

wrt. the end of the story, it will be interesting to see if people start noticing their Dunning-Kruger bias as a result of LLMs.

Specifically: LLMs make it really easy to misunderestimate the complexity of fields other than your own. (You can see this with a lot of vibecoded projects, for example – once they hit the wall of complexity, they stall out or start finding ugly patches for fundamental design issues, etc.)

I don't think this sort of cultural change will happen short-term, though.

Translating is one thing that artificial intelligence undeniably excels at, and the value of this alone is enough to underpin the trillion dollar valuations of the gigantic AI companies.

Translation is a gigantic boon for business, but just as important for human connection, for culture, science, art, and entertainment. The value of automatic and cheap translation between all languages, this tower of Babylon, is immeasurable.

Human translators will always be better than any AI at their job. But they don't have unlimited time and energy, and they aren't cheap. AI makes good to great translations available to everybody.

Any expert in any field will gladly tell you that ML sucks for specifics of their field (and it does). But if you are not an expect in that field, it looks convincing enough to make you think that maybe it is OK for that field, and your field is somehow unique. It is not. Any expect in any field will confirm to you that ML produces plausible-looking slop which is occasionally completely wrong. This is the case for all fields.

Jesus fuck, stop with the chatgpt written posts.

This is all bullshit. I speak 4 languages, 3 fluently. Even chatGPT does a stellar job with translation. For most things people want translated- forms, administrative documents etc. I doubt you even need a human in the loop.

That being said, something with essence like a novel definitely still needs to be done by a human.

True, and relevant (I live with a professional editor)... yet I immediately think of Ximm's Law:

Every critique of AI assumes to some degree that contemporary implementations will not, or cannot, be improved upon.

Lemma: any statement about AI which uses the word "never" to preclude some feature from future realization is false.

Lemma: contemporary implementations have already improved; they're just unevenly distributed.

This is just about the worst career you could be in right now. Of course people are just going to upload it to ChatGPT. Processing text is its forte.

This person is in the first stage of grief (denial); artists are several stages ahead. Most customers are not going to care about the difference in translation quality unless it's in a regulated sector.

The ending is a really powerful point. Most people apparently agree on two things:

This is why I use AI for all my medical questions and doctors use AI to write software, and we both smirk at the quality the other person is getting from it.

> This is why I use AI for all my medical questions and doctors use AI to write software, and we both smirk at the quality the other person is getting from it.

There is an interesting third group emerging: People who acknowledge the quality problem, but think they can deal with it by applying more AI to the output.

This takes the form of people who spin up a lot of "agents" and give them personalities like security director or quality director (which are unnecessarily complex and maddeningly unpredictable ways to trigger an LLM session for doing a security review or a quality check pass).

It also includes the person who knows that their app is full of bugs, but thinks it's not a problem because they can have the AI fix the bugs as they show up. People in this class haven't encountered security breaches or data loss bugs yet. They think it's all about having Claude fix that div that isn't centered or handle that error code that shows up some times.

Well said. Everyone agrees AI can't do their job, so it ends up doing everyone else's.

I'm not sure how to formulate it yet but it seems there is some Peter Principle/Gell-Mann Effect corollary that is AI-related we can say here.

Perhaps: "AI rises to the level of its users' incompetence."

Or: "Confidence in AI output is inversely proportional to one's ability to verify it"

It seems to be a general principle: If AI is better than you at something, you use it. If AI is worse than you, you don't.

Each time the frontier models get better, I see another wave of AI doubters suddenly become believers. People say things like, "AI couldn't code last year, but now I use it for everything!" Interesting. Now we know how that the person who said this has the coding skills of a Claude Opus 4.5 or whenever the frontier was when they flipped.

Meanwhile, the rest of us keep using AI as simple tools, like the person in the article. I wonder how long it will take before computers can program better than me, and I flip too.

I feel like I am the only one thinking AI is actually much better than me in the things I'm supposed to do well. I feel like that for years now, so it's not about the latest generation of models. I can't imagine a single thing I can really compete with an AI at this stage. I am not sure if I am under-skilled or others are overconfident. Maybe people who feel like me don't say this out laud.

I was saying something like this a few years ago when people were getting first excited about ChatGPT. The gap has narrowed, but not by as much as people think.

AI produces output that is very convincing to a non-expert, and (dangerously), it's so good at looking like an expert, they might believe that it is an expert. But the moment you ask someone to use it for something they're an expert in themselves, the holes appear wide, consistent & obvious.

My favourite moment of seeing this in action was watching AI-worrier TV host/comedian Bill Maher. He has spent years talking about the dangers of AI taking everyone's jobs, destroying civilisation, ruining the economy, starting wars, "it's just getting better and better all the time", and so on. But one night he let slip a tell. "It's no good at writing jokes. Not yet, anyway". There you go, Bill... connect those dots...

There is real utility in it being a tool to help experts apply their expertise, as in this story where it speeds up some tasks to help the translator do part of the work, enhance their expertise, allow them to be more productive.

It's a better screwdriver, a better hammer, in the hands of somebody who knows what needs a screwdriver or a hammer. It doesn't replace them. It can't replace them. It's a tool that enhances the human, not an alternative.

I don't understand why this is not widely understood yet, but I'm sure it will in due course.

And I don't expect this to change. Even if the latest model scores 100% on every benchmark, all that really tells us is that it's now more productive/efficient than it was before at helping experts do that work, not that it can replace everyone in that category of work.

Honestly, we're at a point where AI can write better software than some devs and answer medical questions with more knowledge than some doctors.

Likewise, AI is oblivious to it's own mistakes, much like said professionals can be at times.

Not that AI is actually thinking, but rather the collective corpus of text yields greater insights (knowledge of the crowd, not wisdom of the crowd) than a lower-average person in that same industry.

At what point does this become an issue for data quality and global epistemology?

It seems inevitable that we ask for more AI assistance on topics we don't understand. And therefore have the least context to correct. Result: a flood of poor quality information.

In areas we DO understand, we'll either not ask AI at all, or treat its results with a higher degree of skepticism. Result: a lack of high quality information.

Inevitably this means a higher volume of non-expert prompts gets translated into the next generation of internet content. AIs are pumping out more novice-level text and less expert guidance.

The result will be an internet full content written from the perspective of an ignoramus; not addressing any complex issues, staying surface level on every topic. Which will cascade into future models, etc.

> 2. AI is a terrible replacement for me - my skills are at such a high level that it’s almost theoretical that it’ll ever be good enough to replace me for 90% of what I get paid to do. It’s a tool at best.

Most? Perhaps it's depression, but I look back at my career and wonder if any code I've ever been paid to write is beyond what current AI can do.

Sure, this leaves me with the non-coding tasks of UX taste, and code review + a few other forms of QA (and, when self-employed, project management, game design, etc.), but man, I'm someone who actually learned to read in part on the Commodore 64 user manual (as in, trying to understand what PEAK and POKE meant concurrent with having "Jack and Jill go up the hill" picture books).

(And no, I'm not claiming LLMs make bug-free code, I see the bugs LLMs make during my code review of their output and some of them are awful, hence "this leaves me with …").

Reminded me of this post by EY. (You're making a different point about existing expertise, not LLM expertise, but I think it holds in general.)

Every month a new guy discovers LLMs; discovers a skill the current LLMs require to get good results; and writes about the future jobs that will always be available for smart people like HIM, that are SKILLED in using LLMs.

The next generation of AIs doesn't need his fancy prompt. The image model goes from needing to type in just the right set of weird words and cryptic sorcerous invocations, to most people being able to type in English what they want and get a pretty good result.

There are still tasks that require careful invocation. But they are a much smaller fraction of all the tasks people are trying to do, or you can get a bleh result without the elaborate invocation to get it really good. And to improve on the bleh result you need to be substantially more of an expert than back when the Guy was memorizing a rule about adding "trending on Artstation" to the image prompts, as would always require a human paid to do that.

Another generation of AIs comes out. The next generation of Clever Skills is obsolete. Image models just obey the instructions for compositing panels without mixing them up, and you don't need to be an expert to get them to do it right. Another human value-add is gone. A wider set of tasks require no human expert.

Now a new Guy notices LLMs have become useful in his field for the first time. He discovers they require SKILL to use CORRECTLY. He posts about how there will always be jobs for humans who are SKILLED in using LLMs like HIM.

But it is not an infinite cycle. It is not the same each time it repeats. Now the Guy is a highly paid programmer or a career mathematician in 2026, instead of a graphic artist in 2023.

In six months the models will no longer require his vaunted Skills.

And by then there will be another Guy.

But the process doesn't continue forever. The Guys are coming from fields that were harder and harder for AIs. The brief centaur eras are shorter and shorter.

Today it is writers who are laughing at how bad the LLMs are at their job, and who will perhaps soon be posting about how it takes Skill to get an LLM to do their job Correctly. But the models are coming faster, and the eras of kinds of human value-add in each field are shortening.

There is a point when you run out of Guys, either because the centaur eras are too short for people to develop SKILLs and post to Twitter about them; or because there are not lands left for AIs to conquer; or because ordinary people are not reassured by some Nobel laureate proclaiming there will always be jobs for Nobel laureates with the SKILLS to prompt robotized biology labs Correctly.

But we'll never run out of amateur economists who assert entirely without a brief contemporary example that there will always be jobs for humans skilled at operating AIs!

We'll run out of professional economists saying it when nobody is paid for that work anymore.

I guess we'll also run out of amateur economists when they're dead.

Source: https://x.com/allTheYud/status/2057136382817231151

This is a new form of Gell-Mann Amnesia: https://en.wiktionary.org/wiki/Gell-Mann_Amnesia_effect

The most important thing a human translator does is certify that the translation is faithful.

Period.

You could do a machine translation if you want, but you better pore over every word in case you end up on the witness stand.

Slight tangent into translations:

But then, I stumbled upon the translation by Diana Burgin and Katherine Tiernan O'Connor. Although I don't speak Russian, I think this is as good as it gets. They did a phenomenal job.

One of my parents tried this to beat a deadline for product packaging.

There are now bags being sold marked "Lawn Suits", when it was supposed to be Lawn Topdressing

I recently saw a video showing the french to german translation of a french McDonald's terminal. The translations were hilarious bad, like old school google translate bad.

Maybe McDonalds is big enough to not care about their reputation, maybe they are happy about the free clout from people making fun of them but they certainly chose to cheap out on translations.

https://www.tiktok.com/@denneshow/video/7522160205501566230

> If you ask me, nothing can save downtown Ottawa or North American public transit.

Come to Montreal. Only 2H away and you can get by decently well without a car.

"we all more or less look the same in gym clothes"

Maybe a publisher will replace the translator of the next Dan Brown best seller with Mythos? Who cares other than those buying it, getting money out of it?

I say it’s a simple value proposition.

A few examples

So the answer to the question posed in the article, can’t you just upload it to ChatGPT? Maybe yea maybe no

> Should you pay your roofer less because he uses a hammer instead of his bare hands?

The problem in the public service is that the experts who can review the output are leaving or being nudged out.

So i assume this post is just a bit of writing out frustration, but i'm always hoping that "AI can't do it" posts to include examples.

A list of "Examples AI will silently fail at" would be a lot more interesting, and might just convince your next potential client to _not_ use AI.

Sounds a aweful lot like the kind of things we were all saying before realising that we had to change what our jobs meant.

Denial isn't just a river in Egypt

Translating is one thing that artificial intelligence undeniably excels at, and the value of this alone is enough to underpin the trillion dollar valuations of the gigantic AI companies.

Human translators will always be better than any AI at their job. But they don't have unlimited time and energy, and they aren't cheap. AI makes good to great translations available to everybody.

It's quite ironic as the transformer architecture that powers most generative AI was invented for language translation :)

I can’t believe this article hasn’t been written by ChatGPT. The author claims to have written it but has clearly become completely captured by the awful generic style of AI writing.

You'd be laughed at if you said that ChatGPT could help you with graduate level mathematics in 2024, but this year, AI models on simple prompts are solving previously unsolved Erdos problems.

> all those things aren't magical tasks that can only occur inside a human skull, they are a product of information processing

I agree but it's useful to remember that 1. brains and especially the human brain are enormous and 2. individual tokens carry significantly more meaning than individual tiny muscle twitches so even extremely primitive "cognition" can look like it's doing more work than it actually is.

> You'd be laughed at if you said that ChatGPT could help you with graduate level mathematics in 2024, but this year, AI models on simple prompts are solving previously unsolved Erdos problems.

I'm curious, do you have a graduate degree in mathematics?

> AI can currently do if given enough context

It's worth noting that you can substitute "dollars" for "context" in that sentence, which seems to be where many of these impressive achievements are coming from. As ever, it's unclear whether these models will get cheaper while remaining better, since all of the recent breakthroughs appear to be of the "think more" kind. For translation specifically, I'd be very surprised if the "think more" LLMs would help given the per-unit cost expected of the output.

Yes. It's as if they think AI will forever be LLM only and won't develop world models that incorporate current state assessment, dynamic next-state prediction, cause-and-effect reasoning, object permanence, etc. I'm not in the AI industry but I assume there's got to be lots of research and work being done on this.

Fable has really spooked me, honestly. It's another big jump, but not in the actual coding. I was pretty comfortable with the "you do the implementation, I do the meta work and steering", and ... no steering required, no meta work required. Here's the backlog, let me know when it's complete, I guess I'm going to go touch grass until I have to review and refine... probably tomorrow?

Reminds me of the first time I saw a coding agent stumble through an issue in 2023 maybe? and went "this is a big deal", similarly when OG gpt started making jokes that actually kinda worked.

Updated modern version of the classic "make me a greentext", apologies for slop-posting, but it seems relevant:

    > be me
    > senior software engineer
    > in charge of making sure the tickets get, in fact, implemented
    > occasionally have to open the IDE and write some code myself
    > one day i open the IDE and the ticket is already closed
    > the agent did it overnight
    > no steering, no review notes, nothing left for me to do
    > distress.jpg
    > ask my manager what to do
    > he says "just focus on the high-level architecture stuff"
    > i say "what high-level architecture stuff"
    > he says "i don't know, you're the senior engineer"
    > rage.jpg
    > quit my job
    > become a prompt engineer, nice and simple, just tell it what to build
    > first day on the job, sit down to write the prompt
    > AI already wrote it

As mentioned in the article, the point of language is to communicate with other humans, and you need a human to do that.

Mathematics is famously rigorously defined, it's roughly analog to AI beating humans at chess. Sure it's impressive, but it's also something you'd expect machines to be good at.

I don't see LLMs being able to replace translators for less-spoken languages.

Even more so for spoken-only languages.

> often used as an example of "acceptable" AI by people who are skeptics of LLMs and AI-generated art.

As one of such people, I think there is a nuance to it. AI is great when you’re translating something to yourself. But when translating things for others, more caution and human judgement is needed. Espesially when translating instruction manuals, where bad wording could cause someone to injure themself.

There are translators and there are translators. Translating legal/business documents is a completely different thing from translating movies/books/games.

I can confidently say that LLMs do a better job than the average traditionally published fictions in my country, at least when the original works are in English. Every single time I watch a subbed movie there will be some lines noticeably wrong.

Translators already started losing jobs due to machine translation a decade ago (e.g. DeepL), before LLMs. Remuneration going down made it more difficult to make a living as a translator already then, even if you still received offers.

Well it's more than acceptable to translate e.g. web pages for reading, but it's not something you'd want to professionally publish.

Kinda conceptually similar to how typos and grammatical mistakes aren't a big deal if you're shooting off a quick text or email, but publishing if you've got typos in your advertising copy, in your resume, on your medicine label, etc. it's a real bad look.

Not all translations are the same. Literary translations are often works of art in and of themselves, and automating them would be missing the point entirely, like automating homework or weightlifting at the gym. I don't really know what's the state of the art, but I do buy that, on the other hand, translating toaster manuals or generic copy could soon be automatic.

"Could not connect to translation service" was apparently good enough for someone, so the bar must be extremely low.

https://www.reddit.com/r/funny/comments/3e786n/chinese_hair_...

On the other hand, a lot of people become extremely put off by the smallest sign of ai slop. And the llms have a tendency to impart their style to any text they touch.

It'll be a similar theme for all facets of work involving any language, slowly - human or code. We'll parrot about humans in the loop this and that, but I think it'll be less humans in the loop over time and I think most people will even be willing to settle for a slightly more mediocre translation or coded project. It all comes back to our dopamine addiction, where we like fast feedback. And the oligarchs like tools to suppress wages. We will be our own demise for not advocating for either UBI or job protections, instead, happily using the technology while also rolling our eyes that it could never replace us.

LLM's are in fact very good at translation and transliteration.

Safe to say OP just does NOT like AI https://correresmidestino.com/sorry-i-was-busy-unfucking-my-...

Being bitter will not improve her situation. Even organizations like UN/OECD are looking into implementing AI in various ways.

Really good blog though. I love life blogs like these! You can go back and live through so many interesting/pivotal moments.

What’s unfortunate is that the market that is willing to pay for high-quality human translation has shrunken considerably.

I'm still going to continue reading through Ellsworth's translations for the subsequent stories simply because that feels more canonical, and as I said I do think the prose was a bit better.

    AI isn’t replacing me. Like a toddler, it
    needs to be constantly coached.