AI Tech

NVIDIA Inference Breakthrough Enhances Conversational AI from the Cloud to the Edge

TensorRT 8, the eighth version of NVIDIA's AI software, was released today, cutting inference time for language queries in half, enabling developers to create the world's best-performing search engines, ad recommendations, and chatbots and provide them from the cloud to the edge.

The improvements in TensorRT 8 provide record-breaking speed for language applications, executing BERT-Large, one of the world's most commonly used transformer-based models, in 1.2 milliseconds. Previously, companies had to reduce the size of their models, which resulted in considerably less accurate findings. Companies may now double or treble their model size with TensorRT 8 to make significant gains inaccuracy.

TensorRT has been downloaded over 2.5 million times by more than 350,000 developers from 27,500 companies across various industries, including healthcare, automotive, finance, and retail, during the last five years. TensorRT applications can be used in hyperscale data centers, embedded product platforms, and automotive product platforms.

Sparsity is a new performance approach in NVIDIA Ampere architecture GPUs that improves efficiency and allows developers to accelerate neural networks by minimizing computational operations.

Quantization-aware training allows developers to utilize trained models to perform inference in INT8 precision without sacrificing accuracy. This substantially reduces compute and storage overhead for efficient Tensor Core inference.

Broad Industry Support
Industry leaders have used TensorRT for deep learning inference applications in conversational AI and various other areas.

Hugging Face is an open-source AI leader on which the world's biggest AI service providers in various sectors rely. The company collaborates with NVIDIA to provide ground-breaking AI services that will allow text analysis, neural search, and conversational applications at scale.

TensorRT is being used by GE Healthcare, a global leader in medical technology, diagnostics, and digital solutions, to help accelerate computer vision applications for ultrasounds, a key tool for disease detection. Through its innovative healthcare solutions, this allows doctors to provide the greatest quality of treatment.

Availability
TensorRT 8 is now widely accessible and free to NVIDIA Developer program members. In addition, the most recent versions of plug-ins, parsers, and examples are open-source and accessible via the TensorRT GitHub repository.

About NVIDIA
The GPU, invented by NVIDIA in 1999, sparked the growth of the PC gaming industry and redefined modern computer graphics, high-performance computing, and artificial intelligence. The company's ground-breaking work in accelerated computing and artificial intelligence is reshaping trillion-dollar sectors, including transportation, healthcare, and manufacturing, and fueling the growth of many others.

Spotlight

Other News
AI Tech

AI and Big Data Expo North America announces leading Speaker Lineup

TechEx Events | March 07, 2024

AI and Big Data Expo North America announces new speakers! SANTA CLARA, CALIFORNIA, UNITED STATES, February 26, 2024 /EINPresswire.com/ -- TheAI and Big Expo North America, the leading event for Enterprise AI, Machine Learning, Security, Ethical AI, Deep Learning, Data Ecosystems, and NLP, has announced a fresh cohort of distinguishedspeakersfor its upcoming conference at the Santa Clara Convention Center on June 5-6, 2024. Some of the top industry speakers set to take the stage are: - Sam Hamilton - Head of Data & AI – Visa - Dr Astha Purohit - Director - Product (Tech) Ops – Walmart - Noorddin Taj - Head of Architecture and Design of Intelligent Operations - BP - Temi Odesanya - Director - AI Governance Automation - Thomson Reuters - Katie Sanders - Assistant Vice President – Tech - Union Pacific Railroad - Prasanth Nandanuru – SVP - Wells Fargo - Rodney Brooks - Professor Emeritus - MIT These esteemed speakers bring a wealth of knowledge and expertise to an already impressive lineup, promising attendees a truly enlightening experience. In addition to the speakers, theAI and Big Data Expo North Americawill feature a series of presentations covering a diverse range of topics in AI and Big Data exploring the latest innovations, implementations and strategies across a range of industries. Attendees can expect to gain valuable insights and practical strategies from presentations such as: How Gen AI Positively Augments Workforce Capabilities Trends in Computer Vision: Applications, Datasets, and Models Getting to Production-Ready: Challenges and Best Practices for Deploying AI Ensuring Your AI is Responsible and Ethical Mitigating Bias and Promoting Fairness in AI Systems Security Challenges in the Era of Gen AI and Data Science AI for Good: Social Impact and Ethics Selling Data Democratization to Executives Spreading Data Insights across the Business Barriers to Overcome: People, Processes, and Technology Optimizing the Customer Experience with AI Using AI to Drive Growth in a Regulated Industry Building an MLOps Foundation for AI at Scale The Expo offers a platform for exploration and discovery, showcasing how cutting-edge technologies are reshaping a myriad of industries, including manufacturing, transport, supply chain, government, legal sectors, financial services, energy, utilities, insurance, healthcare, retail, and more. Attendees will have the chance to witness firsthand the transformative power of AI and Big Data across various sectors, gaining insights that are crucial for staying ahead in today's rapidly evolving technological landscape. Anticipating a turnout of over 7000 attendees and featuring 200 speakers across various tracks, AI and Big Data Expo North America offers a unique opportunity for CTO’s, CDO’s, CIO’s , Heads of IOT, AI /ML, IT Directors and tech enthusiasts to stay abreast of the latest trends and innovations in AI, Big Data and related technologies. Organized by TechEx Events, the conference will also feature six co-located events, including the IoT Tech Expo, Intelligent Automation Conference, Cyber Security & Cloud Congress, Digital Transformation Week, and Edge Computing Expo, ensuring a comprehensive exploration of the technological landscape. Attendees can choose from various ticket options, providing access to engaging sessions, the bustling expo floor, premium tracks featuring industry leaders, a VIP networking party, and a sophisticated networking app facilitating connections ahead of the event. Secure your ticket with a 25% discount on tickets, available until March 31st, 2024. Save up to $300 on your ticket and be part of the conversation shaping the future of AI and Big Data technologies. For more information and to secure your place at AI and Big Data Expo North America, please visit https://www.ai-expo.net/northamerica/. About AI and Big Data Expo North America: The AI and Big Data Expo North America is a leading event in the AI and Big Data landscape, serving as a nexus for professionals, industry experts, and enthusiasts to explore and navigate the ever-evolving technological frontier. Through its focus on education, networking, and collaboration, the Expo continues to be a beacon for those eager to stay at the forefront of technological innovation. “AI and Big Data Expo North Americais a part ofTechEx. For more information regardingTechExplease see onlinehere.”

Read More