← Sony

H1 2026

Sony

Filed Patents 101 patents

Overview

Sony filed 101 patents in H1 2026 across Audio (17), UI/UX (4), AI & Machine Learning (41), Hardware (8), Game Engines (3), Platforms (7), Graphics (5), VR & AR (5), Networking (1), Cloud Gaming (5), Monetization (2), and Streaming (3).

The AI & Machine Learning category covers machine learning applications ranging from automatic highlight generation and personalized tutorial systems to NPC dialogue that responds to player speech and LLM-powered development tools that generate game code. Audio patents describe adaptive technologies including spatial positioning systems, hearing accessibility features that adjust sounds based on Audiograms, and wireless speaker setups that mix game Audio with chat and music simultaneously. Hardware, Graphics, VR & AR, and Cloud Gaming patents detail controllers with swappable components and adaptive touch surfaces, ray-tracing optimizations using spectral rendering, motion sickness detection for headsets, and edge computing solutions that buffer streams at cell towers to reduce latency.

Technology Themes

The 17 Audio patents span a wide range of problems, from physical speaker placement to personalized hearing compensation to voice chat management. Several of them tackle the challenge of hearing loss directly: one system applies individualized Audiogram data alongside real-time biometric signals like EEG and heart rate to continuously update a user's hearing profile and selectively adjust specific sound categories such as explosions or shouting, while 2 companion patents describe efficient methods for executing that adjustment in real time, one using pre-stored filter combinations to approximate a dynamic boost curve without building custom filters from scratch, and another applying loudness-dependent gain per frequency band so that a quiet sound and a loud sound in the same range receive different treatments. A separate spatial Audio patent lets players manually position secondary Audio sources like chat or music in a 3D listening environment, with an AI layer that steps in to prevent those sources from colliding with game Audio, while another inserts local commentary into the natural silent gaps of chat channels and places it directionally to avoid overlap. On the speaker Hardware side, 3 patents describe wireless speaker systems that receive game Audio and phone Audio simultaneously over separate radios, mix them with per-channel equalization, and use stereo echo cancellation (rather than single-source cancellation) to isolate microphone input cleanly while multiple Audio streams play. For music integration, 2 patents use AI to match a player's personal playlist to gameplay intensity in real time, one by detecting the tempo and energy of in-game events and syncing music accordingly, and the other by attribute-matching the user's songs to the stylistic characteristics of the original game score so the two blend naturally moment to moment. Additional patents cover adaptive speaker positioning that uses microphones on speakers, controllers, and cameras to triangulate a listener's location and shift the stereo sweet spot to follow them around the room; a voice chat system that modifies Audio quality based on in-game relationships and distances between players; a text-to-speech system that gives game developers phoneme-level control over character voice pitch without requiring re-recording; and an AI prioritization system that monitors which game moments and communications are most important and automatically adjusts competing Audio streams to keep them audible.

4 UI/UX patents address how players interact with games and each other at the interface level. One system watches for patterns of repeated input failure, identifies the sections of a game where a player consistently struggles, and automatically generates personalized practice scenarios that reflect actual difficulty without spoiling story content, displaying controller inputs visually in real time so players can build muscle memory. Another patent describes a handoff system for passing game control between players, combining slow-motion or looping gameplay with visual button highlights, haptic pulses, and Audio cues so the incoming player can orient themselves before taking over. A VR-focused patent monitors how absorbed a player is in a session and displays that status visually to people nearby, while queuing incoming messages and holding them until a lower-intensity moment makes delivery appropriate. The fourth patent in this category removes ambiguity from NPC interactions in crowded environments by combining gaze direction and proximity data to determine which character a player actually intends to interact with, eliminating accidental triggering of the wrong conversation.

The 41 AI/ML patents cover a broad range of applications across gameplay, content creation, accessibility, and social features. Several patents focus on in-game assistance delivered through natural language: one routes real-world notifications like texts and emails through NPC dialogue at contextually appropriate moments rather than overlaying a generic alert, another lets players speak directly to in-game characters to receive story recaps, navigation help, and tips delivered in the character's own voice, and a third builds a full virtual teammate powered by a large language model that monitors health, tracks enemies, and offers tactical advice during single-player sessions. Related to NPC behavior, one patent creates a bidirectional loop between a game engine and a language model so that NPC speech stays synchronized with what actually occurred in-game, while another trains NPCs on real player chat data so their dialogue reflects genuine conversational style and expires automatically if the gameplay moment that prompted it has already passed. Difficulty and player skill receive attention across multiple patents: one system profiles a player's behavioral style and constructs challenges specifically designed to push against their habits rather than scaling difficulty linearly, and another combines predictive skill modeling with haptic guidance through controller vibrations to help struggling players without simply lowering the difficulty setting. A cluster of patents addresses personalized recaps and tutorials, including a system that analyzes why a player quit (distinguishing boredom from difficulty from external factors) and constructs an appropriate refresher on return, and another that curates a "previously on" style video summary from saved game state data with interactive move tutorials embedded inside it. On the content generation side, patents describe an AI pipeline that converts gameplay session data into storybooks, highlight reels, podcasts, and memes; a system that generates smooth video transitions between gameplay moments using a generator-discriminator feedback loop; a 3D asset generation system that uses NeRF-to-mesh conversion to produce personalized in-game items in under 2 minutes; and a game content generation tool where designer-authored keyframes constrain what a generative model is allowed to produce so output stays stylistically consistent. Social and broadcast features appear across several patents: one uses eye-tracking to detect when 2 players are looking at each other in a multiplayer game and automatically suggests or initiates a chat channel between them, another automates the full Esports broadcast pipeline by combining AI-driven camera direction with dynamic narration trained on spectator attention data, and a crowd-participation system uses cameras pointed at stadium audiences to translate mass body movements into collective game inputs without requiring personal devices. Additional patents cover an ML pipeline that fuses camera, accelerometer, and microphone data to recognize player intent without a traditional controller; a system that converts player motion descriptions to text and challenges remote players to mimic them; AI-driven trophy personalization that generates unique achievement presentations based on how and when a player earned them; a gameplay analysis tool that reads video clips, view counts, and social comments to produce structured quality reports with the option to automatically adjust game parameters based on findings; a machine learning system that watches gameplay video and reverse-engineers which controller buttons were pressed to create annotated learning content; a route-based storytelling system for passengers in cars that feeds live navigation data into a language model to generate location-aware interactive narratives displayed on the vehicle's windows; a single training framework for NPC agents that lets designers tune personality style at inference time by adjusting reward weight sliders without retraining; a broadcasting assistance system that builds personalized gaming podcasts narrated in the voices of characters from a player's own history; an Audio personalization system that learns individual sound preferences across sessions and modifies game Audio in real time through game engine integration; an AI model that strips games down to essential story moments for players with limited time; an automated accessibility narration system that generates Audio descriptions and inserts them into natural dialogue gaps; a system that watches gameplay footage and creates 3D overlays of popular player paths using crowd-sourced behavioral data; a closed-loop QA pipeline that detects performance issues, diagnoses them with ML, attempts fixes, and only escalates to human reviewers when automation fails; a multi-modal ML system that fuses visual and inertial data to generate personalized video game recaps by combining existing help content; an in-game voice command system that lets players direct AI teammates, monitor squad status, and issue strategic instructions using natural speech; and finally a context-aware assistant that passively monitors video frames during gameplay and proactively generates answers to questions the player has not yet asked.

Eight Hardware patents describe physical devices and engineering approaches for controllers and other gaming peripherals. 3 of them share a focus on arcade-style fight stick controllers: one uses microswitches and a mechanical coupling system for precise digital directional input, another replaces switches with magnetic sensors to provide true analog input through a full-size arcade lever, and a third adds a user-accessible swappable gate system with onboard storage for multiple gate shapes so players can switch joystick movement patterns for different game types without tools. A fourth Hardware patent addresses a more general ergonomics problem by introducing an angled button pad set at an oblique angle relative to the controller body to replicate arcade cabinet button positioning on a modern console controller. Separate from joystick design, one patent describes a controller with a selective button-locking mechanism that disables system buttons like the home button during competitive play without affecting any action inputs. Moving away from traditional input devices entirely, one patent describes an optical sensor panel that detects pretouch gestures and renders software-defined input regions on a flat surface, letting a single panel emulate any button layout and adapt to individual users without physical modification. The remaining 2 patents involve non-controller Hardware: one creates a bidirectional API bridge between gaming software and a parked vehicle's native systems so that in-game events can physically actuate components like headlights, horns, and seats, and another uses a coordinated swarm of drones as a repositionable aerial projection surface for displaying game video outdoors.

The 3 game engine patents describe tools intended to support the development and runtime management of games. One applies large language models to a multi-file-aware development environment where the AI can generate, edit, and cross-reference multiple code files simultaneously within a unified interface that includes version history and inline testing, going beyond single-file chat-based code assistants. A related patent uses LLMs to produce game schemas that maintain dependency relationships between game objects and then generates executable code that correctly instantiates those objects with their connections intact. The third patent in this group addresses runtime interruption handling by replacing binary pause and resume states with a spectrum of gameplay intensity levels, so that when a player needs to deal with a communication or other interruption, the game reduces intensity gradually rather than stopping entirely, keeping multiplayer sessions from halting abruptly.

Platform-level capabilities across 7 patents cover authentication, social features, and cross-environment data integration. One patent turns the game controller into an authentication node that detects when a paired phone is nearby and automatically logs the player in without requiring password entry on the console. A friend-matching patent uses probabilistic analysis of past play session timing to surface recommendations for players whose schedules and gaming habits overlap, rather than relying on existing friend lists or manual searches. 2 patents deal with physical collectibles embedded with codes: one uses prior digital gameplay history to prompt meaningful in-person meetups when players who have gamed together online are in physical proximity, and the other creates a dynamic link where real-world engagement with the collectible modifies in-game parameters rather than simply unlocking static content. A patent connecting in-car gaming to home console experiences allows routes, images, and achievements gathered during vehicle travel to be imported and used to procedurally modify home game content, creating continuity between physical travel and in-game progression. An always-recording DVR buffer patent gives players instant access to recent gameplay through a controller button, letting them review missed NPC dialogue or forgotten objectives without exiting the session. The final Platforms patent uses machine learning to automatically detect game-specific event structures in third-party game code and generate standardized telemetry instrumentation without requiring developers to write that instrumentation themselves.

Five Graphics patents address rendering efficiency and visual quality. Sony's spectral rendering patent reduces the computational cost of ray tracing by distributing wavelength sampling across neighboring pixels and successive frames rather than computing multiple wavelengths per pixel per frame, selecting which wavelengths to sample based on material properties and lighting conditions. A second patent addresses AI upscaling models that run alongside the Graphics pipeline and dynamically switches between pre-cached neural networks of varying precision levels when performance monitoring detects that a scene is pushing Hardware load, maintaining frame rate without visibly reducing resolution. A third patent combines traditional mesh-based rendering with a neural radiance approach embedded directly in the fragment shader, letting the mesh handle geometry while the neural network handles surface detail, achieving photorealistic output at real-time speeds without requiring a full NeRF computation. Texture Streaming efficiency receives attention in a patent that performs a lightweight partial render first to identify which textures are actually visible in the current scene and at what mipmap level before requesting data, avoiding the wasteful transfer of texture data that will never be displayed. The fifth patent uses Gaussian splatting to generate a volumetric 3D representation from game video and metadata, allowing user annotations like coaching markup to be inserted into the scene with correct depth ordering and occlusion rather than appearing as flat 2D overlays.

The 5 VR/AR patents span headset ergonomics, calibration, face reconstruction, and privacy. One patent monitors headset motion sensor data continuously during VR sessions and uses ML to detect early postural stability changes that precede motion sickness, allowing the system to warn players and adjust display settings before symptoms appear rather than responding after the fact. A gesture calibration patent shifts hand-tracking calibration from a one-time pre-game process to an ongoing one that uses actual gameplay interactions to refine recognition models, evaluating accuracy based on game outcomes and updating without requiring players to exit or pause. A resource allocation patent applies gaze and gesture tracking in XR headsets to direct image processing power toward where the player is looking and interacting, reducing compute load on areas outside that focus. For multiplayer social presence, one patent uses recurrent neural networks to reconstruct the parts of a player's face hidden behind a headset in real time, so that other participants in a session can see a complete facial representation during interaction. The fifth patent addresses privacy by filtering out the voice-caused motion components captured by HMD sensors, preventing speech from being reconstructed from headset accelerometer data while preserving the head tracking data needed for the experience.

A single Networking patent describes a hybrid server architecture for multiplayer gaming in which game servers are provisioned dynamically from the pool of players' own consoles, removing the need to choose between centralized developer-maintained servers and manually configured community-run alternatives. The system handles allocation automatically without requiring players to have any technical knowledge or take any deliberate action.

The 5 cloud gaming patents address Streaming reliability, latency, and resource allocation. One patent deploys buffering proxies at 5G base stations that understand the structure of game streams well enough to abandon retransmission of frames whose display window has already passed and to pre-activate proxies at adjacent towers before a handoff occurs, maintaining continuity during movement between cells. When packets do drop, a complementary patent generates predicted next frames locally on the client device before they are needed, filling gaps in the stream without waiting for the server to retransmit anything. A predictive asset Streaming patent analyzes game state to determine what content the player is likely to encounter next and pre-loads only those assets at bandwidth-appropriate quality levels, avoiding both upfront bulk loading and reactive on-demand fetching. A cloud-based capture system aggregates player failure data across sessions, uses that data to trigger targeted video capture from client machines for remote diagnostics, and routes the same infrastructure toward highlight reel generation, serving both QA and content creation purposes simultaneously. The fifth patent enables split-screen multiplayer where one player's instance runs locally on a resource-constrained device while the other streams from a cloud server or remote console, with latency compensation and adaptive quality settings keeping both sides synchronized.

Two Monetization patents address the same opportunity from slightly different angles. Both describe systems that detect when a game enters a loading or wait state and use that window to serve alternate content such as advertisements, trailers, or playable mini-games, treating the interruption as a programmable content slot rather than dead time. The distinction between the 2 lies primarily in implementation detail: one frames the delivery mechanism as a Platforms-level injection system that operates without requiring the game developer to integrate it, while the other describes a game-aware pipeline that selects contextually relevant content matched to the specific type of wait state detected.

Three Streaming patents center on tools for coaches, spectators, and creators working with game video. 2 of them use Gaussian splatting and game metadata to allow user-drawn annotations to behave as 3D objects within footage rather than flat overlays, so that paths, arrows, and drawings automatically adjust their position, speed, and visibility based on scene depth and game physics. The distinction between these 2 is emphasis: one focuses on the coaching use case and how annotations animate with the physical properties of the game world, while the other describes the broader technical pipeline for extracting metadata and mapping user content to volumetric scene geometry. The third Streaming patent takes a different approach entirely, using AI to reconstruct a full 3D representation from a 2D game stream so that spectators can view the same gameplay from camera angles of their own choosing, independent of what the player's camera shows.

Patent Sources (101)

All data sourced from USPTO patent filings. Google Patents may take several weeks to index recent publications. If a link is unavailable, search for the patent number at USPTO Patent Public Search.

All Sony patents → All companies → Database coverage →