Is PettiChat using ChatGPT?

No. PettiChat uses Alibaba's Qwen model family, which is a separate LLM developed in China. The architecture is similar to GPT — they're both transformer-based language models — but they're trained by different companies on different data.

Does Qwen actually understand what the dog is saying?

Qwen never receives the bark. It receives a label from the on-device classifier (e.g., 'anxious') plus context, and writes a sentence that fits. The understanding-of-the-bark, such as it is, happens earlier in the pipeline at the classifier step.

Why doesn't PettiChat just run the LLM on the device?

Current pet collars don't have the compute to run a frontier LLM locally. Even small on-device LLMs would be slower and lower-quality than calling the cloud. Battery cost would also be significant.

Could a US company build the same product with GPT or Claude?

Architecturally, yes. The hard part isn't the LLM — those APIs are widely available. The hard part is the on-device classifier and the training data. Without the labeled pet-vocalization corpus, a US version would either need to license one or build one from scratch.

Pet Translation Technology

What Alibaba's Qwen Model Actually Does for PettiChat — A Plain-English Explainer

PettiChat sends your dog's bark to Alibaba's Qwen LLM. What's actually happening in that round-trip — classification vs generation vs the 94.6% number.

The editorial team

Published

May 31, 2026

Read

11 min read

PettiChat's marketing tells you the dog barks, the collar listens, and Alibaba's Qwen model translates it into English. Which is true, in the same way that "you press a button and your phone takes a picture" is true. There's an enormous amount of work hiding behind that sentence, and the work is where the honest read on this product lives.

If you've read our piece on the 2026 AI pet collar landscape or the data-business angle, this is the technical companion. The goal is to give you enough understanding of the pipeline to evaluate the claims yourself.

The 30-second version

Here is what happens, in order, when your dog barks while wearing the Chinese PettiChat:

The collar's microphone records the bark and the surrounding motion (from a small IMU sensor).
On-device hardware classifies the bark into one of several pre-defined emotional or behavioral categories (similar to what Petpuls does, but with more inputs).
The classification result — plus contextual metadata (time of day, recent activity, repeating patterns) — gets sent over Bluetooth/Wi-Fi to your phone, and from your phone to Alibaba Cloud.
On Alibaba Cloud, Qwen (a large language model) receives the classification + context as a structured prompt and returns a natural-sounding sentence.
The sentence appears in your app, attributed to your dog.

Two things to notice. First, the model is not "decoding the bark" in any literal sense — that step is the classification, which happens before Qwen sees anything. Second, the "translation" you read is generated by a language model, not transcribed. The model is doing the same kind of work it does when ChatGPT writes you an email: producing plausible text that fits the inputs.

What Qwen is, briefly

Qwen is Alibaba's family of large language models. It is to Alibaba what GPT is to OpenAI, Claude is to Anthropic, and Gemini is to Google. The models are trained on broad text (web pages, books, code, conversations), they accept text as input, and they produce text as output.

The version PettiChat appears to be using is Qwen-2 or Qwen-2.5, which Alibaba released as open-weights through the Qwen GitHub project and offers as a paid API through Alibaba Cloud's Tongyi Qianwen service. The model performs competitively with the leading Chinese-language models and is comparable to mid-tier US frontier models on most benchmarks.

That's all you need to know about Qwen specifically. It's a capable LLM and Alibaba hosts the inference. The collar isn't doing anything exotic with the LLM side — it's using a normal commercial LLM API.

The classification step is where the real work happens

The interesting (and harder) part of the pipeline is what happens before Qwen ever sees the data: the classification of the dog's bark into a category.

The 36Kr interview with PettiChat's founder lists the training corpus as approximately 890,000 cat samples and 650,000 dog samples, reviewed by what the company describes as "experts." That dataset, if real and well-labeled, would be one of the larger published pet-vocalization corpora in the world.

What that data is for: training a classifier to take a chunk of audio (plus accelerometer/gyroscope data from the collar's IMU) and assign it to a small number of categories. The exact category taxonomy isn't published, but based on similar systems and the company's marketing, it likely includes:

Happy / playful / excited
Anxious / distressed
Bored / lonely
Hungry / thirsty
Demanding attention
Reactive (territorial barking)
Pain / discomfort

The classifier output is something like a probability distribution over these categories — "62% anxious, 23% bored, 15% other," plus a confidence score and contextual signals (motion data, time since last interaction, etc.).

This is honest, normal machine learning. It is the same kind of work Petpuls does, but with a larger dataset and additional input modalities. The 94.6% accuracy figure almost certainly refers to performance on this classification step, not on the literal "translation" step. As The Underbite documented in covering the Traini Kickstarter, that 94.6% number predates the LLM layer.

Why this distinction matters

"94.6% accuracy on bark classification" is a falsifiable, testable claim. You can hold out a set of labeled barks, run them through the classifier, and check whether the output matches the labels. We have no independent verification of PettiChat's specific number, but the kind of claim it is — a classification accuracy — is the kind of claim that can be true.

"94.6% accuracy on translating bark to English sentence" is not a testable claim, because there is no ground truth for what the dog actually meant. The classifier might be 94.6% accurate at deciding the dog is "happy," but the sentence Qwen produces — "I missed you so much! Can we go play?" — is not a translation in any verifiable sense. It is a plausible English caption that fits a happy dog.

Both can be useful. People enjoy reading what their dog is "saying" even when they intellectually know it's generated. The objection isn't that the experience is bad. The objection is that the marketing slides past this distinction, and most readers walk away thinking the LLM is decoding rather than narrating.

Where the LLM does add real value

To be fair to the design, Qwen isn't just there for flavor. The LLM layer does a few things that pure classification can't:

Context blending. A bark at 3 a.m. after the dog just came back from a walk is different from a bark at 8 a.m. when no one has fed the dog yet. The LLM gets the time, the activity history, and the classification together, and produces output that reflects all of it.

Multi-turn coherence. If your dog "says" "I'm hungry" and you don't feed it, the next bark's caption can reference that. The LLM keeps short-term state across recent interactions. (Whether your dog has that kind of continuity in its actual experience is a separate question.)

Natural-language fluency. A classifier output of "65% hungry, 20% bored" is not a UX. "I'm getting hungry over here — can we eat?" is a UX. The LLM converts machine output into human-readable text.

Multilingual output. Qwen handles Chinese, English, Japanese, and others reasonably well. The same classifier output can drive captions in any of these languages.

None of these are translation in the technical sense. They're sensible UX choices that make the product feel alive. Whether that experience is worth $118 is a separate judgment call.

How this compares to other approaches

We compared the broad architectures across the major AI pet products in the 2026 landscape piece. Briefly, the architectures available right now are:

Product	Approach	LLM involved?
Petpuls	On-device classification → category label	No
FluentPet Connect	Button-press logging + sequence pattern matching	No (some ML for sequences)
PettiChat (Meng Xiaoyi)	Classification + cloud LLM for caption generation	Yes (Qwen)
PettiChat (Traini)	Classification + cloud LLM (PETTI model) for two-way mode	Yes (PETTI)
MeowTalk	Phone-only audio classification → category label	No
Sentra	Health/activity ML + alerts	Minimal

The LLM-on-top approach is novel, but it doesn't change the underlying limitation: the classifier on the device still has to do the actual work of distinguishing what the animal is communicating. The LLM only changes how the result is presented.

This is why we keep stressing that "94.6% accuracy" lives at the classification step. If you want to evaluate whether PettiChat is "real AI" or "marketing AI," that classifier's quality is the thing that matters. The Qwen layer is genuinely good language model work — that's not in dispute — but it's not where the unique-to-pets value lives.

What we don't know yet

Several open questions a serious buyer should keep in mind:

The labeling protocol. "Reviewed by experts" is vague. Was the labeling done by veterinarians? Behaviorists? Was inter-rater reliability measured? Without published methodology, the 94.6% figure is hard to evaluate.

The benchmark set. Was the 94.6% accuracy measured on held-out test data the model had never seen? Was it measured cross-breed (a model trained on Beijing dogs probably classifies Beijing dogs better than Texas dogs)? The published materials don't say.

Update cadence. LLMs improve. Will Qwen-3 give us better captions? Will the classifier model be retrained as more user data flows in? Both are likely. Neither is committed to.

Privacy of the audio stream. Audio is being uploaded to Alibaba Cloud. We have a separate piece on the privacy implications — short version, the privacy posture of a product depends a lot on which version of PettiChat you actually receive and which country's law applies.

The bottom line

PettiChat's pipeline is a perfectly reasonable application of modern machine learning. The on-device classifier is doing real work. The cloud LLM is making the output legible and emotionally satisfying. The headline accuracy number is genuine, just not measuring what casual readers think it's measuring.

If you understand what you're buying — an emotional-state classifier with a sophisticated text-generation UX — the product can be a delight. If you think you're buying literal animal translation, you'll be disappointed.

We think the honest framing is: PettiChat is the first consumer product to combine pet vocalization classification with an LLM-driven natural language interface. That is a real product novelty. It is not a translator. The distinction matters for how you should weight reviews, marketing claims, and your own expectations.

Sources

The technical claims in this piece come from:

Spend $118 on an AI Pet Collar Just to Hear Your Cat Say 'I Miss You' · 36Kr Europe · 2026-05-25The 36Kr feature this piece quotes — training corpus size (~890k cat samples + ~650k dog samples) and the Qwen integration details.
Qwen — Alibaba Cloud documentation · Alibaba CloudOfficial model-family documentation from the Qwen Team at Alibaba Group.
QwenLM organization on GitHub · GitHubOfficial open-source repositories for the Qwen model family.
Pet AI Startup Claims 'Real-Time' Pet Translation. Here's What That Actually Means. · The Underbite · 2026-04-20Coverage and skepticism on the Traini PettiChat Kickstarter campaign.
Petpuls' Smart Collar Claims to Distinguish Your Dog's Bark · Digital Trends · 2021-01-13Petpuls' Seoul National University testing methodology — five emotional categories at ~80–90% accuracy.
Published model papers on LLM architecturesGeneral field reference (transformer LLM papers from OpenAI, Google, Meta, Alibaba) — no single canonical URL.unlinked

We have not independently verified PettiChat's accuracy numbers. Where claims are contested or unverified, we've said so.

Frequently asked

Is PettiChat using ChatGPT?: No. PettiChat uses Alibaba's Qwen model family, which is a separate LLM developed in China. The architecture is similar to GPT — they're both transformer-based language models — but they're trained by different companies on different data.
Does Qwen actually understand what the dog is saying?: Qwen never receives the bark. It receives a label from the on-device classifier (e.g., 'anxious') plus context, and writes a sentence that fits. The understanding-of-the-bark, such as it is, happens earlier in the pipeline at the classifier step.
Why doesn't PettiChat just run the LLM on the device?: Current pet collars don't have the compute to run a frontier LLM locally. Even small on-device LLMs would be slower and lower-quality than calling the cloud. Battery cost would also be significant.
Could a US company build the same product with GPT or Claude?: Architecturally, yes. The hard part isn't the LLM — those APIs are widely available. The hard part is the on-device classifier and the training data. Without the labeled pet-vocalization corpus, a US version would either need to license one or build one from scratch.

Continue reading

AI Pet Collars