[Feature]: zhipuai相关：GLM-4V支持？ #1691

wl223600 · 2024-04-08T10:43:51Z

Class | 类型

大语言模型

Feature Request | 功能请求

现状

目前GPT Academic已具备完善的智谱AI的glm-4和glm-3-turbo支持，上述模型没有直接解读图片的能力。
根据GLM-4文档显示，glm-4v可以解读图片（~~虽然glm-4v的上下文长度仅有2k~~）。
目前GPT Academic对glm-4v尚无完整支持。

bridge_zhipu.py (Line 77)

    if llm_kwargs["llm_model"] in ["glm-4v"]:
        have_recent_file, image_paths = have_any_recent_upload_image_files(chatbot)
        if not have_recent_file:
            chatbot.append((inputs, "没有检测到任何近期上传的图像文件，请上传jpg格式的图片，此外，请注意拓展名需要小写"))
            yield from update_ui(chatbot=chatbot, history=history, msg="等待图片") # 刷新界面
            return
        if have_recent_file:
            inputs = make_media_input(inputs, image_paths)
            chatbot[-1] = [inputs, ""]
            yield from update_ui(chatbot=chatbot, history=history)

bridge_all.py

（暂无glm-4v相关代码）

com_zhipuglm.py

（似乎也没有呢）

参考

GLM-4模型文档节选（完整文档）

模型编码	描述	上下文长度
glm-4	最新的 GLM-4 、最大支持 128k 上下文、支持 Function Call 、Retreival。	128k tokens
glm-4v	实现了视觉语言特征的深度融合，支持视觉问答、图像字幕、视觉定位、复杂目标检测等各类多模态理解任务。	2k tokens

另：GLM-4V完整文档

The text was updated successfully, but these errors were encountered:

binary-husky · 2024-04-09T16:12:42Z

glm-4v的代码尚未通过测试，这部分我们需要帮助

Menghuan1918 · 2024-04-11T04:20:17Z

#1700

binary-husky added the Need Help From Developers label Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: zhipuai相关：GLM-4V支持？ #1691

[Feature]: zhipuai相关：GLM-4V支持？ #1691

wl223600 commented Apr 8, 2024

binary-husky commented Apr 9, 2024

Menghuan1918 commented Apr 11, 2024

[Feature]: zhipuai相关：GLM-4V支持？ #1691

[Feature]: zhipuai相关：GLM-4V支持？ #1691

Comments

wl223600 commented Apr 8, 2024

Class | 类型

Feature Request | 功能请求

现状

bridge_zhipu.py (Line 77)

bridge_all.py

com_zhipuglm.py

参考

binary-husky commented Apr 9, 2024

Menghuan1918 commented Apr 11, 2024