In addition to Weibo, there is also WeChat
Please pay attention
WeChat public account
Shulou
2025-09-27 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > IT Information >
Share
Shulou(Shulou.com)11/24 Report--
CTOnews.com November 16 news, Microsoft Ignite 2023 conference has begun today, Nvidia executives attended the meeting and announced the update of TensorRT-LLM, adding support for OpenAI Chat API.
CTOnews.com reported in October that Nvidia launched Tensor RT-LLM open source libraries for data centers and Windows PC. The biggest feature is that if the Windows PC is equipped with Nvidia GeForce RTX GPU,TensorRT-LLM, the LLM can run four times faster on the Windows PC.
At today's Ignite 2023 conference, Nvidia announced an update to TensorRT-LLM, adding Chat API support for OpenAI, and enhanced DirectML capabilities to improve the performance of AI models such as Llama 2 and Stable Diffusion.
TensorRT-LLM can be done locally through Nvidia's AI Workbench, and developers can use this unified, easy-to-use toolkit to quickly create, test, and customize pre-trained generative AI models and LLM on PC or workstations. Nvidia also launched a pre-emptive experience registration page for this purpose.
Nvidia will release an update to TensorRT-LLM 0.6.0 later this month with a fivefold improvement in reasoning performance and support for other mainstream LLM such as Mistral 7B and Nemotron-3 8B.
Users can run on GeForce RTX 30 series and 40 series GPU with more than 8GB video memory, and some portable Windows devices can also use fast and accurate local LLM functions.
Related readings:
"Nvidia launches Tensor RT-LLM to make large language models run four times faster on PC platforms with RTX."
Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.
Views: 0
*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.
The market share of Chrome browser on the desktop has exceeded 70%, and users are complaining about
The world's first 2nm mobile chip: Samsung Exynos 2600 is ready for mass production.According to a r
A US federal judge has ruled that Google can keep its Chrome browser, but it will be prohibited from
Continue with the installation of the previous hadoop.First, install zookooper1. Decompress zookoope
About us Contact us Product review car news thenatureplanet
More Form oMedia: AutoTimes. Bestcoffee. SL News. Jarebook. Coffee Hunters. Sundaily. Modezone. NNB. Coffee. Game News. FrontStreet. GGAMEN
© 2024 shulou.com SLNews company. All rights reserved.