Constructing an enterprise AI firm on a "basis of shifting sand" is the central problem for founders at the moment, based on the management at Palona AI.
At the moment, the Palo Alto-based startup—led by former Google and Meta engineering veterans—is making a decisive vertical push into the restaurant and hospitality house with at the moment's launch of Palona Imaginative and prescient and Palona Workflow.
The brand new choices rework the corporate’s multimodal agent suite right into a real-time working system for restaurant operations — spanning cameras, calls, conversations, and coordinated process execution.
The information marks a strategic pivot from the corporate’s debut in early 2025, when it first emerged with $10 million in seed funding to construct emotionally clever gross sales brokers for broad direct-to-consumer enterprises.
Now, by narrowing its focus to a "multimodal native" method for eating places, Palona is offering a blueprint for AI builders on the way to transfer past "skinny wrappers" to construct deep methods that remedy high-stakes bodily world issues.
“You’re constructing an organization on high of a basis that’s sand—not quicksand, however shifting sand,” mentioned co-founder and CTO Tim Howes, referring to the instability of at the moment’s LLM ecosystem. “So we constructed an orchestration layer that lets us swap fashions on efficiency, fluency, and price.”
VentureBeat spoke with Howes and co-founder and CEO Maria Zhang in particular person just lately at — the place else? — a restaurant in NYC in regards to the technical challenges and arduous classes discovered from their launch, development, and pivot.
The New Providing: Imaginative and prescient and Workflow as a ‘Digital GM’
For the tip consumer—the restaurant proprietor or operator—Palona’s newest launch is designed to operate as an automatic "finest operations supervisor" that by no means sleeps.
Palona Imaginative and prescient makes use of in-store safety cameras to research operational indicators — comparable to queue lengths, desk turnover, prep bottlenecks, and cleanliness — with out requiring any new {hardware}.
It displays front-of-house metrics like queue lengths, desk turns, and cleanliness, whereas concurrently figuring out back-of-house points like prep slowdowns or station setup errors.
Palona Workflow enhances this by automating multi-step operational processes. This consists of managing catering orders, opening and shutting checklists, and meals prep achievement. By correlating video indicators from Imaginative and prescient with Level-of-Sale (POS) knowledge and staffing ranges, Workflow ensures constant execution throughout a number of areas.
“Palona Imaginative and prescient is like giving each location a digital GM,” mentioned Shaz Khan, founding father of Tono Pizzeria + Cheesesteaks, in a press launch supplied to VentureBeat. “It flags points earlier than they escalate and saves me hours each week.”
Going Vertical: Classes in Area Experience
Palona’s journey started with a star-studded roster. CEO Zhang beforehand served as VP of Engineering at Google and CTO of Tinder, whereas Co-founder Howes is the co-inventor of LDAP and a former Netscape CTO.
Regardless of this pedigree, the staff’s first 12 months was a lesson within the necessity of focus.
Initially, Palona served style and electronics manufacturers, creating "wizard" and "surfer dude" personalities to deal with gross sales. Nonetheless, the staff rapidly realized that the restaurant {industry} offered a singular, trillion-dollar alternative that was "surprisingly recession-proof" however "gobsmacked" by operational inefficiency.
"Recommendation to startup founders: don't go multi-industry," Zhang warned.
By verticalizing, Palona moved from being a "skinny" chat layer to constructing a "multi-sensory info pipeline" that processes imaginative and prescient, voice, and textual content in tandem.
That readability of focus opened entry to proprietary coaching knowledge (like prep playbooks and name transcripts) whereas avoiding generic knowledge scraping.
1. Constructing on ‘Shifting Sand’
To accommodate the fact of enterprise AI deployments in 2025 — with new, improved fashions popping out on an almost weekly foundation — Palona developed a patent-pending orchestration layer.
Somewhat than being "bundled" with a single supplier like OpenAI or Google, Palona’s structure permits them to swap fashions on a dime primarily based on efficiency and price.
They use a mixture of proprietary and open-source fashions, together with Gemini for pc imaginative and prescient benchmarks and particular language fashions for Spanish or Chinese language fluency.
For builders, the message is obvious: By no means let your product's core worth be a single-vendor dependency.
2. From Phrases to ‘World Fashions’
The launch of Palona Imaginative and prescient represents a shift from understanding phrases to understanding the bodily actuality of a kitchen.
Whereas many builders battle to sew separate APIs collectively, Palona’s new imaginative and prescient mannequin transforms present in-store cameras into operational assistants.
The system identifies "trigger and impact" in real-time—recognizing if a pizza is undercooked by its "pale beige" shade or alerting a supervisor if a show case is empty.
"In phrases, physics don't matter," Zhang defined. "However in actuality, I drop the telephone, it all the time goes down… we wish to actually work out what's occurring on this world of eating places".
3. The ‘Muffin’ Answer: Customized Reminiscence Structure
Some of the important technical hurdles Palona confronted was reminiscence administration. In a restaurant context, reminiscence is the distinction between a irritating interplay and a "magical" one the place the agent remembers a diner’s "traditional" order.
The staff initially utilized an unspecified open-source device, however discovered it produced errors 30% of the time. "I feel advisory builders all the time flip off reminiscence [on consumer AI products], as a result of that may assure to mess all the pieces up," Zhang cautioned.
To resolve this, Palona constructed Muffin, a proprietary reminiscence administration system named as a nod to internet "cookies". Not like customary vector-based approaches that battle with structured knowledge, Muffin is architected to deal with 4 distinct layers:
-
Structured Knowledge: Secure information like supply addresses or allergy info.
-
Gradual-changing Dimensions: Loyalty preferences and favourite objects.
-
Transient and Seasonal Recollections: Adapting to shifts like preferring chilly drinks in July versus sizzling cocoa in winter.
-
Regional Context: Defaults like time zones or language preferences.
The lesson for builders: If the perfect obtainable device isn't ok on your particular vertical, you have to be prepared to construct your personal.
4. Reliability by way of ‘GRACE’
In a kitchen, an AI error isn't only a typo; it’s a wasted order or a security danger. A latest incident at Stefanina’s Pizzeria in Missouri, the place an AI hallucinated pretend offers throughout a dinner rush, highlights how rapidly model belief can evaporate when safeguards are absent.
To forestall such chaos, Palona’s engineers observe its inside GRACE framework:
-
Guardrails: Arduous limits on agent habits to forestall unapproved promotions.
-
Pink Teaming: Proactive makes an attempt to "break" the AI and determine potential hallucination triggers.
-
App Sec: Lock down APIs and third-party integrations with TLS, tokenization, and assault prevention methods.
-
Compliance: Grounding each response in verified, vetted menu knowledge to make sure accuracy.
-
Escalation: Routing advanced interactions to a human supervisor earlier than a visitor receives misinformation.
This reliability is verified by way of huge simulation. "We simulated one million methods to order pizza," Zhang mentioned, utilizing one AI to behave as a buyer and one other to take the order, measuring accuracy to remove hallucinations.
The Backside Line
With the launch of Imaginative and prescient and Workflow, Palona is betting that the way forward for enterprise AI isn't in broad assistants, however in specialised "working methods" that may see, hear, and suppose inside a particular area.
In distinction to general-purpose AI brokers, Palona’s system is designed to execute restaurant workflows, not simply reply to queries — it's able to remembering clients, listening to them order their "traditional," and monitoring the restaurant operations to make sure they ship that buyer the meals based on their inside processes and pointers, flagging each time one thing goes flawed or crucially, is about to go flawed.
For Zhang, the objective is to let human operators give attention to their craft: "For those who've obtained that scrumptious meals nailed… we’ll inform you what to do."