Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

idea: Training Ichigo on Structured Output #126

Open
hahuyhoang411 opened this issue Nov 20, 2024 · 1 comment
Open

idea: Training Ichigo on Structured Output #126

hahuyhoang411 opened this issue Nov 20, 2024 · 1 comment
Assignees
Labels
type: idea Research, data, any new ideas

Comments

@hahuyhoang411
Copy link
Contributor

Problem Statement

Current LLM development is moving toward structured output. It's proved to improve model performance in various tasks. Also when training with structured output, we can explore further into training long context which we haven't trained.

Idea

Reference: https://arxiv.org/pdf/2411.10440

@hahuyhoang411 hahuyhoang411 added the type: idea Research, data, any new ideas label Nov 20, 2024
@hahuyhoang411 hahuyhoang411 self-assigned this Nov 20, 2024
@hahuyhoang411
Copy link
Contributor Author

hahuyhoang411 commented Nov 20, 2024

Paper summary:

Structured responses significantly improve the model’s systematic reasoning ability. To achieve this, they design <SUMMARY>, <CAPTION>, <REASONING>, <CONCLUSION> tags to help the model recognize the current stage of reasoning, and create the LLaVA-o1-100k dataset by using GPT-4o to generate stage-level reasoning.

Inference time scaling: Unlike previous methods like best-of-N search and sentence-level beam search, we propose a novel stage-level beam search method. Specifically, we generate multiple responses for each stage (marked by tags) and select the best one to proceed to the next stage.

@hiento09 hiento09 added this to Menlo Nov 22, 2024
@github-project-automation github-project-automation bot moved this to Investigating in Menlo Nov 22, 2024
@tikikun tikikun moved this from Investigating to In Review in Menlo Nov 25, 2024
@bachvudinh bachvudinh added this to the Ichigo v0.5 milestone Nov 25, 2024
@dan-menlo dan-menlo modified the milestones: Ichigo v0.5, Ichigo v0.6 Nov 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: idea Research, data, any new ideas
Projects
Status: Eng Review
Development

No branches or pull requests

3 participants