Apple is continuing to increase its AI capabilities at a fairly rapid pace. As a case in point, consider the Cupertino giant's latest AI model that is able to create an entire 3D scene from a single 2D image, and that too in under a second.
Apple's new video generation AI model is lightning fast and fairly accurate
Apple has now published a study, titled "Sharp Monocular View Synthesis in Less Than a Second." The study details how Apple's engineers were able to train an AI model, called SHARP, to generate a "photorealistic" 3D view from a single 2D image.
Critically, Apple claims that the view generation takes "less than a second on a standard GPU via a single feedforward pass through a neural network."
Basically, SHARP predicts what a 3D scene, distilled from a 2D image, would look like by taking into account the image's "nearby viewpoints."
The study notes:
"The 3D Gaussian representation produced by SHARP can then be rendered in real time, yielding high-resolution photorealistic images for nearby views. The representation is metric, with absolute scale, supporting metric camera movements."
For the benefit of those who might not be aware, 3D Gaussian Splatting is a technique that is used to create photorealistic 3D scenes by representing such scenes as millions of "splats," which are basically tiny colored blobs. To create a full scene, however, often requires numerous 2D images from various angles.
Apple's SHARP differs in that it is able to recreate an entire photorealistic scene from a single 2D image by predicting depth and colors, and too in under a second.
What's more, you can now try Apple's SHARP AI model for free by heading over to the dedicated GitHub page.
Follow Wccftech on Google to get more of our news coverage in your feeds.
