- AI Models for Manufacturing: Factory operations demand both physical execution by robots performing tasks such as surface finishing, assembly, inspection, and reasoning in the form of planning, scheduling, health monitoring, and decision support. Powering these operations requires two complementary model classes that work together. Foundation models, trained on broad manufacturing knowledge and fine-tuned on domain-specific data, take a current state and objective as input and generate a probability distribution over possible actions. World models take a state and action pair as input and predict the resulting next state, enabling agents to predict the consequences of an action before committing to it. For physical agents, world models are grounded in physics: capturing how forces, contact, material properties, and geometry transform under a given robot action. For digital agents, world models capture operational dynamics such as how a scheduling decision propagates through a production line, or how a maintenance action affects downstream throughput. Foundation models interpret operational context and generate candidate actions. World models evaluate candidate actions by simulating their physical or operational consequences. These two models working in conjunction enable domain agents to reason about manufacturing decisions rather than simply pattern-match to prior experience. AI models provide the equivalent of super-specialized human expertise across every relevant manufacturing domain: depth that no single human can possess, made continuously available by FSI across the entire factory.
- Domain-specialist agents that deliver superhuman performance: Running a factory requires deep expertise across many simultaneous functions: industrial engineering, process engineering, mechanical design, robot cell operations, prognostics and health management, sustainment, and forward deployed support. Dedicated domain-specialist agents are needed for each of these functions, powered by the world models and foundation models described in AI Models for Manufacturing. These agents come in two forms: physical agents that directly interact with parts, executing surface finishing, assembly, and inspection with precision, payload, and sustained throughput no human operator can match across a full production run, and digital agents that handle planning, scheduling, health monitoring, and domain-specific reasoning without physical interaction. What makes these agents truly domain-specialists, rather than general-purpose automation, is that each is built to reason and act within the full context of its function. This specialization is what drives superhuman capability: each agent has instantaneous access to all relevant domain knowledge, reasons forward about consequences rather than pattern-matching to past experience, and operates without the fatigue, bandwidth limits, or the serialization bottlenecks that force human experts to address one problem at a time. This is the equivalent of having a fully staffed team of best-in-class specialists across every factory function, available simultaneously and continuously.
- Agent orchestration and coordination to deliver optimized performance at the system level: Delivering optimal factory performance requires activating the right agent at the right time, supplying it with the right information, and reacting rapidly to its outputs, while continuously accounting for planned downtime, preventive maintenance, and contingency situations. But effective orchestration must also respect a fundamental difference in how physical and digital agents operate. Digital agents reason and act within the same layer: their outputs are decisions, recommendations, and plans that carry no immediate physical consequence. Physical agents, by contrast, operate across two coupled layers: a reasoning layer that predicts the right behavior given the current state and objective, and a fast execution layer that carries out that behavior through precise real-time control. Orchestration must bridge these two layers knowing when to intervene at the reasoning level, when to let execution proceed autonomously, and how to reconcile physical execution state with digital planning recommendations. FSI provides a unified framework for multi-agent, multi-timescale coordination, ensuring that domain-specialist agents collaborate coherently rather than optimize in isolation. This is the equivalent of having an effective management and organizational structure that understands not just what needs to be done, but at what level and on what timescale to act.
- Architecture to support continuous learning and adaptation: A factory is not a static environment. Parts and materials may change, tooling wears, and new processes are introduced. We need the underlying AI to improve continuously in the face of this variability, not by simply retraining on new data, but through a more principled mechanism rooted in how agents handle uncertainty. When a domain-specialist agent encounters a situation where its current models cannot confidently predict outcomes, it triggers a structured learning cycle: generating hypotheses about what is unknown, safely exploring those hypotheses in simulation, and validating the most promising ones through carefully designed real-world trials. Observations from those trials flow back to refine the underlying models, which in turn improve simulation fidelity, which enables smarter hypotheses in the next cycle.This continuous Real-to-Sim-to-Real loop is what is needed to achieve adaptation in response to new requirements. New agents can be synthesized as new processes are introduced, and existing agents grow more capable over time as their models become more accurate and complete. This is the equivalent of having a factory culture that does not merely react to change, but systematically learns from every operation to raise the ceiling of what is possible.
Authors
-
Dr. Satyandra K. Gupta is Co-Founder and Chief Scientist at GrayMatter Robotics, where he leads the company's foundational research in physical AI, computational decision-making, and human-centered robotics. He holds the Smith International Professorship in the Viterbi School of Engineering at the University of Southern California and serves as the founding director of the USC Viterbi Center for Advanced Manufacturing. Dr. Gupta previously served as Program Director for the National Robotics Initiative at the National Science Foundation (2012–2014). He has authored more than 500 technical articles and delivered over 200 invited talks worldwide. He is a Fellow of AAAS, ASME, IEEE, NAI, SME, and the Solid Modeling Association. He serves on the Technical Advisory Committee for the Advanced Robotics for Manufacturing (ARM) Institute and a member of the Association for Advancing Automation (A3) Robotics Technology Strategy Board. In the past he served on the National Materials and Manufacturing Board. He is a former Editor-in-Chief of the ASME Journal of Computing and Information Science in Engineering. His research has been covered by the Economist, Forbes, LA Times, IEEE Spectrum, Smithsonian Magazine, and numerous other leading publications.
-
Dr. Omey Manyar is a Lead Robotics Engineer at GrayMatter Robotics, where he works on Physical-AI systems that help robots operate autonomously in complex, real-world manufacturing environments. He holds a Ph.D. in Robotics from the University of Southern California, where his research explored how robots can learn to handle deformable objects using physics-informed learning. Along the way, he has had the privilege of working with teams at Toyota Research Institute, Amazon Robotics, and Rolls-Royce in Singapore. His research has been published at venues including ICRA, IROS, and ASME, and has been recognized with multiple Best Paper Awards. Personal Website: https://omey-manyar.com/