[feat] Request response refusal validator #1189

railsstudent · 2024-12-09T09:23:54Z

Description
Request a validator that determines whether or not a LLM refuses a prompt and generates an output that starts with texts such as "I cannot", "I can't", and "It is illegal"

Why is this needed
If the response is refused, it should not be returned to the client application to display. The validator should throw a validation error that the application should handle appropriately.

Implementation details
I suppose Huggingface provides models for response refusal checking.

End result
After a LLM generate a text, the validator is used to validate whether the response is refused by having but not limited to texts such as "I can not", "I can't", "It is not legal", etc.

JosephCatrambone · 2024-12-09T18:39:47Z

Ooh. That's a neat idea. It's not on the current sprint listing but I'd like to take a swing at it.

railsstudent added the enhancement New feature or request label Dec 9, 2024

zsimjee assigned JosephCatrambone Dec 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Request response refusal validator #1189

[feat] Request response refusal validator #1189

railsstudent commented Dec 9, 2024

JosephCatrambone commented Dec 9, 2024

[feat] Request response refusal validator #1189

[feat] Request response refusal validator #1189

Comments

railsstudent commented Dec 9, 2024

JosephCatrambone commented Dec 9, 2024