Latest in Tech

Curated news and product updates across AI, cloud, data and developer platforms.

IndustryAug 2025

NVIDIA unveils next-gen GPU platform focused on AI inference efficiency

The new architecture targets lower latency and power per token, with optimized transformer kernels and memory pipelines for LLMs.

OpenAIAug 2025

Updates make it easier to blend text, image, and structured tools in a single flow, with expanded governance controls.

AWSJul 2025

New controls for safety, cost attribution, and enterprise connectors reduce friction for production GenAI deployments.

MicrosoftJul 2025

Deeper plugin model and orchestration features help teams turn internal systems into natural-language copilots.

Google CloudJun 2025

Faster hybrid search and guardrails improve retrieval quality for RAG while reducing infrastructure overhead.

AppleJun 2025

New neural blocks and memory bandwidth targets cut energy cost per inference for private, offline experiences.

MetaMay 2025

Improved evaluations and distribution tooling aim to make open models easier to adopt responsibly.

GitHubMay 2025

Build policies, provenance, and SBOM integration move into the default developer workflow to reduce risk.

SnowflakeApr 2025

Tighter online/offline parity simplifies production ML with lower operational overhead.

CloudflareApr 2025

Edge caching for embeddings and retrieved chunks reduces tail latency for AI assistants worldwide.