Bug Report: The model frequently generates repetitive token sequences. #368

Razaghallu786 · 2024-12-18T18:16:31Z

Description of the bug:

No response

Actual vs expected behavior:

No response

Any other information you'd like to share?

No response

Razaghallu786 · 2024-12-18T18:21:02Z

Bug Report: Repetitive Token Generation in "gemini-1.5-flash" Model

Description of the Bug:
When generating long texts using the "gemini-1.5-flash" model, repetitive token sequences frequently occur, resulting in infinite loops and exhausting the token limit. This behavior is consistent across both the Vertex and Gemini APIs.

Example:

"The judgment can be appealed in a motion for reconsideration, claiming that the judge did not consider the evidence properly. The judgment can be appealed in a motion for reconsideration, claiming that the judge did not consider the evidence properly. The judgment can be appealed in a motion for reconsideration, claiming that the judge did not consider the evidence properly. The judgment can be

Steps to Reproduce:

Use the "gemini-1.5-flash" model via Vertex or Gemini API.
Generate a long text (e.g., a legal or technical document).
Observe the generated output for repeated phrases or sentences.

Expected Behavior:
The model should produce coherent, non-repetitive text.

Actual Behavior:
The model enters a repetitive loop, generating the same token sequences indefinitely until the token limit is reached.

Impact:

Resource Waste: Tokens are wasted, increasing costs and exhausting API usage limits.

Output Quality: The generated text becomes unusable, requiring additional API requests.

Reproduction Rate:
Occurs frequently when generating long-form text.

Workaround:
There is currently no known workaround to prevent this issue.

Request for Resolution:

Investigate and resolve the cause of repetitive token generation.
Implement a mechanism to detect and avoid repetitive loops during generation.
Consider offering refunds or credits for tokens wasted due to this bug.

Actual vs. Expected Behavior:
Actual Output:

"The judgment can be appealed in a motion for reconsideration, claiming that the judge did not consider the evidence properly. The judgment can be appealed in a motion for reconsideration, claiming that the judge did not consider the evidence properly. The judgment can be appealed..."

Expected Output:
"The judgment can be appealed in a motion for reconsideration, claiming that the judge did not consider the evidence properly."

Vitalina12512 · 2024-12-18T20:44:24Z

gmKeshari · 2024-12-19T05:01:29Z

Hi @Razaghallu786,

Could you please provide a bit more clarification on this? Is this happening with some features like function calling or structured output or just simply running the above prompt??

Giom-V · 2024-12-21T22:13:20Z

Which temperature are you using? If you are using 0, can you try with a higher one?

Razaghallu786 closed this as completed Dec 18, 2024

Razaghallu786 reopened this Dec 18, 2024

gmKeshari added type:bug Something isn't working status:triaged Issue/PR triaged to the corresponding sub-team component:examples Issues/PR referencing examples folder labels Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug Report: The model frequently generates repetitive token sequences. #368

Bug Report: The model frequently generates repetitive token sequences. #368

Razaghallu786 commented Dec 18, 2024

Razaghallu786 commented Dec 18, 2024

Vitalina12512 commented Dec 18, 2024

gmKeshari commented Dec 19, 2024

Giom-V commented Dec 21, 2024

Bug Report: The model frequently generates repetitive token sequences. #368

Bug Report: The model frequently generates repetitive token sequences. #368

Comments

Razaghallu786 commented Dec 18, 2024

Description of the bug:

Actual vs expected behavior:

Any other information you'd like to share?

Razaghallu786 commented Dec 18, 2024

Vitalina12512 commented Dec 18, 2024

gmKeshari commented Dec 19, 2024

Giom-V commented Dec 21, 2024