Groq Is The Second AI Startup To Debut AI Chip In The Cloud, Used By Nimbix

Evan Federowicz • Jan 27, 2020 at 04:43am EST

Groq's tensor streaming processor (TSP) silicon is now available to accelerate customer's AI workloads in the cloud. This cloud service provided by Nimbix utilizes the Groq hardware as an on-demand service for "selected customers" only. Groq now joins Graphcore as the only two cloud service providers with accelerators commercially available for customers to use.

Groq's TSP silicon is now utilized in Nimbix's machine learning acceleration on-demand service for "selected customers" only.

Nimbix's CEO, Steve Hebert, stated: "Groq's simplified processing architecture is unique, providing unprecedented, deterministic performance for compute-intensive workloads, and is an exciting addition to our cloud-based AI and Deep Learning platform."

Groq's TSP chip is capable of an enormous 1,000 TOPS ( 1 Peta operations per second), this chip also launched last fall. Groq recently published results show how the chips can achieve 21,700 inferences per second for ResNet-50 v2 inference. According to Groq, this more than doubles the performance of GPU-based systems. The results posted by Groq shows that their architecture is one of the fastest and possibly the fastest commercially available neural network processor.

Jonathan Ross, Groq's co-founder, and CEO stated: "These ResNet-50 results are a validation that Groq's unique architecture and approach to machine learning acceleration delivers substantially faster inference performance than our competitors." He also stated, "These real-world proof points, based on industry-standard benchmarks and not simulations or hardware emulation, confirm the measurable performance gains for machine learning and artificial intelligence applications made possible by Groq's technologies."

One key feature is that Groq's performance advantage doesn't rely on batching. Batching is a common technique in the data center where multiple data samples are processed at a time to improve throughput. According to Groq, its architecture can reach peak performance even at batch = 1. A common requirement for inference applications that may be working on a stream of data arriving in real-time. While the new TSP chip offers a 2.5x latency advantage over GPUs at large batch sizes at batch = 1, Groq has stated that the actual latency advantage is closer to 17x.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on Groq Is The Second AI Startup To Debut AI Chip In The Cloud, Used By Nimbix

Groq Is The Second AI Startup To Debut AI Chip In The Cloud, Used By Nimbix

Groq's TSP silicon is now utilized in Nimbix's machine learning acceleration on-demand service for "selected customers" only.

Trending Stories

Square Enix Shareholder Derails 46th Meeting to Praise The Adventures of Elliot, As Publisher Hints At Future Of Final Fantasy

Sony Fans Rally 30,000 Signatures to Save PlayStation Discs, but Factory Retooling Already Signals a Lost Fight

Steam Machine User Faces First Case of Red Line of Death “RLOD” On His Unit Just 20 Minutes In, Indicating a GPU Failure

Apple’s Dual-Foundry Approach To Reach Fruition Much Faster Than Anticipated, As Company Plans To Adopt Intel’s 18A Process For The Base iPhone 18’s A20

Bloober Team Ditches Cronos Survival Dread for Aggressive Combat as Lazarus Lands on PC, PS5, Xbox Series, and Switch 2

Popular Discussions

AMD Zen 6 Gains a New Low-Power Core Beyond Zen 6 and Zen 6C, Surfacing in Linux Kernel Patches

Intel Expected To Restart Supply Of 10th, 12th, 13th, And 14th Gen Processors In Mainland China

RTX 5090 Arrives at Repair Shop With Its 16-Pin Connector Blown to Smithereens, Killing the GPU and VRAM

PlayStation 6 Bill of Materials Is Now Very Close to the Dreaded $1,000 Line, But a Delay Still Isn’t Likely

Sony Just Killed the Disc for PlayStation 6, and Microsoft’s “Project Helix” Xbox Is Reportedly Following

Groq Is The Second AI Startup To Debut AI Chip In The Cloud, Used By Nimbix

Groq's TSP silicon is now utilized in Nimbix's machine learning acceleration on-demand service for "selected customers" only.

Related Story JEDEC Approves SPHBM4 to Break HBM’s Costly Packaging Bottleneck, Retaining HBM4-level Speeds With Standard Packages

Further Reading

Trending Stories

Popular Discussions