Discussion about this post

User's avatar
Yasser Tamer Atef's avatar

Thank you for this piece, Kaylie.

You’ve managed to navigate a terrain that most commentary on assistive tech tends to flatten or evade entirely. The neutrality you maintain isn’t a lack of position; it’s a form of rhetorical clarity—an insistence that nuance can still speak truth, even (or especially) when the systems we’re assessing are built on asymmetries of power and access.

I want to speak briefly from my own context, which exists outside the dominant geographies where these tools are built, tested, and mythologized.

For example, conversational AI (Gemini in particular) reveals a striking unfamiliarity with cultural referents, political conditions, and linguistic nuance that originate beyond the Global North. The bias isn’t just embedded in data sets. It appears in the assumptions about what counts as intelligible input, what kinds of knowledge matter, and which users are worth designing for.

You noted its failures in pattern matching and evidence gathering. I’d go further: it performs a kind of epistemic triage. It sorts what it understands from what it dismisses, and in that sorting, entire histories, dialects, and lived experiences are misclassified as “errors” or “hallucinations.”

Apps like BeMyEyes offer a compelling use case. However, even here, locality becomes a determining factor. I’ve noticed that user pairings tend to cluster regionally. While this may increase relevance, it also reveals how global inclusion is often restricted by infrastructure and linguistic segmentation. It’s not necessarily a flaw in the design, but it does reinforce how “global” platforms often become regionally siloed in practice.

Here’s a more direct example: if I didn’t speak English fluently, I wouldn’t have been able to access something as seemingly inclusive as the Microsoft Disability Answer Desk. That’s not simply a technological limitation. It reflects a systemic failure in multilingual accessibility—one that conversational AI currently reproduces rather than resolves.

In the case of Gemini, it often renders other languages (Arabic as an example) nearly unintelligible unless explicitly prompted to use a specific variety—Egyptian colloquial Arabic, Gulf Arabic, and so on.

Even then, the result is inconsistent at best. What emerges is not a translation, but a disjointed linguistic approximation that occasionally slips into the absurd. It’s less assistance and more mimicry of language diversity, without genuine understanding.

Regarding document remediation, we are still far from where we need to be. Optical character recognition is no longer the main hurdle. The greater challenge is structuring inaccessible content into usable, navigable formats: complete with headings, sections, and semantic landmarks. To date, I haven’t found an AI model (ChatGPT, DeepSeek, or otherwise) that can perform this task reliably without an exhausting amount of manual prompting. By the time the output becomes usable, I could have already completed the task using Kurzweil or OpenBook.

These aren’t fringe critiques. They are central to the question of what it means for assistive technology to assist. If it doesn't work equitably, then it reproduces the very exclusions it claims to address.

Your writing doesn’t fall into this trap. Instead, it opens space for the deeper conversation that is so often missing—where access is not just a matter of interface, but of justice.

I’ve been commenting a lot (perhaps too frequently) but only because your posts make room for these thoughts. If my engagement ever feels overwhelming, I’m happy to take a step back. But if you're ever up for a deeper conversation, or if you need collaboration or support in any way, I’d be glad to connect.

Expand full comment
Kaylie L. Fox's avatar

What an excellent reply. Thank you so very much for your kind words. And believe me, you are not a bother in the slightest. I actually kind of wanted to delve into research around linguistics and AI, but I am unfortunately a plevian who only knows English as a native language and what languages I picked up here and there is hardly even usable. So I feel like I wouldn't be the person to properly dive into that topic in particular. But, that said, you do raise an interesting point. These models are being developed with the exception of deepseaq and English-speaking territories. Thus, it stands to reason that though there may be training data processing that covers other languages, it's not being intrinsically trained on those languages to be able to handle real actual conversation. You're not the first one to tell me about the linguist Xperia and Jim and I or even in other models. Would love to see a study done on this. Please feel free to reach out on Facebook and request me as a friend or message me. I would be delighted to talk with you more. We seem to share a lot. Ideological agreements. I will also say that from the position of writing these articles, the one thing that I strive to avoid is either oversimplification or the trap of playing up the hype of a product. I don't want to be one of those people that comes along and says AI is going to cure everything, because if you've read some of my other research, AI isn't going to fix all the systemic issues that we have before us, but in fact perpetuates them in some areas. Areas. I don't want to be one of those pro AI types that refuses to see the authentic and true challenges in front of us, especially in a disability field, as there is already so much careless talking going on. They completely undermines the point of inclusion and making sure that everybody's needs are met. Met. AI offers us a huge tool to help bridge a lot of of accessibility needs, but the technology is still in its infancy, despite how technically far we've come. I tried to remember that when I write things like this because it's honestly important to repeat it from time to time. These machines are not fallible even if they can generate good content. They're not right because they are trained on so much data. And they certainly aren't evolved enough to be able to demonstrate the ability to intrinsically pick up on a user's needs without explicitly being directed. Until we get to that point, I'm going to maintain my realistic approach to writing these things. Thank you again so much for your kind words, and if this message comes out disjointed just know it's because I'm using dictation right now and not typing. LOL as always, it's a pleasure engaging with you.

Expand full comment

No posts