Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

新手发问:SFT只有部分assistant的内容计算loss,该如何实现 #6390

Closed
ReycoLi opened this issue Dec 19, 2024 Discussed in #6389 · 0 comments
Closed

新手发问:SFT只有部分assistant的内容计算loss,该如何实现 #6390

ReycoLi opened this issue Dec 19, 2024 Discussed in #6389 · 0 comments
Labels
duplicate This issue or pull request already exists

Comments

@ReycoLi
Copy link

ReycoLi commented Dec 19, 2024

Discussed in #6389

Originally posted by ReycoLi December 19, 2024
数据集的格式如下

"messages": [
{
"role": "system",
"content": "agent职责..."
},
{
"role": "user",
"content": "用户实际问题"
},
{
"role": "assistant",
"content": "Thought:...\n Action: ...\n Observation:..."
}
]

assistant回复中只有Thought和Action部分去计算loss,Observation部分不参与loss计算。
该如何实现这个功能,基于llama factory框架的最佳实践应该是什么,修改哪部分的代码。

求大神指点。

@github-actions github-actions bot added the pending This problem is yet to be addressed label Dec 19, 2024
@hiyouga hiyouga closed this as not planned Won't fix, can't repro, duplicate, stale Dec 19, 2024
@hiyouga hiyouga added duplicate This issue or pull request already exists and removed pending This problem is yet to be addressed labels Dec 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists
Projects
None yet
Development

No branches or pull requests

2 participants