Zhipu AI Announces the Free Release of the GLM-4-Flash Large Model

Published:August 28, 2024

Reading Time:2 min read

Want to read in a language you're more familiar with? Please click the pink icon below for one-click AIPlease click the pink icon top right for one-click AI translation

Zhipu AI announced that the GLM-4-Flash large model is now freely available for use through the Zhipu AI large model open platform.

On August 27, Zhipu AI announced that the GLM-4-Flash large model is now freely available for use through the Zhipu AI large model open platform.

GLM-4-Flash is suitable for completing simple vertical tasks that are low cost and require quick responses, achieving a generation speed of 72.14 tokens/s, which is approximately equivalent to 115 characters/s.

GLM-4-Flash features multi-turn dialogue, web browsing, Function Call, and long-text reasoning (supporting a maximum context of 128K), and supports 26 languages including Chinese, English, Japanese, Korean, and German.

Officially, through the use of adaptive weight quantization, various parallelization methods, batching strategies, and speculative sampling, the model achieves reduced latency and increased speed at the inference level. This not only enhances efficiency with greater concurrency and throughput but also significantly lowers inference costs, allowing it to be offered for free.

In terms of pre-training, the official team has introduced large language models into the data screening process, resulting in 10T of high-quality multilingual data, which is more than three times the amount of the ChatGLM3-6B model; additionally, FP8 technology has been used for pre-training to improve training efficiency and computational capacity.

Zhipu AI Announces the Free Release of the GLM-4-Flash Large Model

Tags

Featured Posts

Zhipu AI Launches GLM‑4.5, an Open-Source 355B AI Model Aimed at AI Agents

China Proposes “World AI Cooperation Organization” at WAIC 2025

AI ‘Godfather’ Geoffrey Hinton Urges Global AI Cooperation at WAIC 2025 in Shanghai

NIO's First Self-developed Chip Is Aimed at Intelligent Cabin, with Ex-Huawei Hisilicon Expert in Charge

Chinese Auto-Driving Company WeRide Raises $310M in Series B Financing Deal Led by Yutong Group

More in AI