Code for the Paper "Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics"
-
Updated
Jun 30, 2022 - Python
Code for the Paper "Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics"
Instruction Following Agents with Multimodal Transforemrs
Collect and maintain high quality instruction finetune datasets in different domain and languages. 搜集並維護高品質各專業領域及語言的指令微調資料集
The repo collects model and data projects for instruction following large language models.
The home of Stambecco 🦌: Italian Instruction-following LLaMA Model
Finetune LLaMA-7B with Chinese instruction datasets
🌱 梦想家(DreamerGPT):中文大语言模型指令精调
Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a seamless and versatile user experience.
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)
An benchmark for evaluating the capabilities of large vision-language models (LVLMs)
WangChanGLM 🐘 - The Multilingual Instruction-Following Model
This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Following" (ICCV 2021). We address the task of long horizon instruction following with a modular architecture that decouples a task into visual perception and action policy prediction.
Code for "Lion: Adversarial Distillation of Proprietary Large Language Models (EMNLP 2023)"
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Code and documentation to train Stanford's Alpaca models, and generate the data.
A collection of ChatGPT and GPT-3.5 instruction-based prompts for generating and classifying text.
PhoGPT: Generative Pre-training for Vietnamese (2023)
Add a description, image, and links to the instruction-following topic page so that developers can more easily learn about it.
To associate your repository with the instruction-following topic, visit your repo's landing page and select "manage topics."