How can I convert `output.jsonl` into a format similar to `SWE-Gym/OpenHands-SFT-Trajectories`? #6830

lycfight · 2025-02-19T14:01:50Z

What problem or use case are you trying to solve?

Describe the UX of the solution you'd like

Do you have thoughts on the technical implementation?

Describe alternatives you've considered

Additional context

I want to train a model using long dialogue trajectory data generated by OpenHands to improve performance. However, I am confused by the output.jsonl data generated by run_infer. Some history entries have an odd number of turns, some have an even number, and some rounds are empty. This differs significantly from the common multi-turn dialogue SFT data format.

Which parts of the data can be used to form a proper multi-turn dialogue training format? Can you explain the meaning of the fields in this file in detail? Also, how can I convert output.jsonl into a format similar to SWE-Gym/OpenHands-SFT-Trajectories?

lycfight added the enhancement New feature or request label Feb 19, 2025

mamoodi added evaluation Related to running evaluations with OpenHands troubleshooting/help User requires help and removed enhancement New feature or request labels Feb 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I convert `output.jsonl` into a format similar to `SWE-Gym/OpenHands-SFT-Trajectories`? #6830

How can I convert `output.jsonl` into a format similar to `SWE-Gym/OpenHands-SFT-Trajectories`? #6830

lycfight commented Feb 19, 2025

How can I convert output.jsonl into a format similar to SWE-Gym/OpenHands-SFT-Trajectories? #6830

How can I convert output.jsonl into a format similar to SWE-Gym/OpenHands-SFT-Trajectories? #6830

Comments

lycfight commented Feb 19, 2025

How can I convert `output.jsonl` into a format similar to `SWE-Gym/OpenHands-SFT-Trajectories`? #6830

How can I convert `output.jsonl` into a format similar to `SWE-Gym/OpenHands-SFT-Trajectories`? #6830