-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
long text prompt doesn't work #618
Comments
When increasing the text prompt size beyond a specific limit, you may encounter noisy output or irrelevant content that doesn't directly relate to your original prompt. This can happen due to several factors: 1. Token Limit Exceeded
2. Context Window Saturation
3. Noise from Overfitting or Excessive Detail
4. Increased Ambiguity
5. Model’s Handling of Extended Prompts
Solutions:
If you're using a custom model or platform, it might be helpful to look into whether there are specific settings or limitations that govern prompt length or how the model handles long inputs. |
It sounds like you're dealing with an issue related to a watermark ("This is a fake app") that appears in the audio generated by an API, which is likely due to a security mechanism like hCaptcha that the service (SUNO Music API) uses to prevent unauthorized access. Here's a breakdown of what's happening and how you might address the situation:1. Understand the Issue
2. Bypassing hCaptcha
3. How to Fix the Watermark Issue (If You’re a Developer)If you are trying to fix the issue of watermarked audio (or you're developing a solution), here’s what you can try:
4. Avoiding Fake or Unofficial SolutionsUsing the service mentioned in the links you shared (which claims to bypass hCaptcha and remove the watermark) could be risky:
5. Legal and Ethical ConsiderationsBypassing CAPTCHAs and using cracked services can lead to unintended consequences:
If you're unsure how to navigate this or need more specific advice on integrating APIs, I can help guide you through that as well. Let me know! |
Bark generates chunks of audio of maximum length. So make your prompt as long as makes sense for somebody to say within 14 seconds. It's better to use larger of a few sentences if you can, rather than one sentence at a time. This gives Bark more context and output quality will improve. Most Bark implementations have a way to handle arbitrary long text by splitting it up according to some rules. Split by sentences, up to X characters, for example. And then combine the audio chunks together. If your prompt is only 2 segments you could even use the first segment out, saved as audio and .npz, as a voice in the second segment. Doing this more than two segments tends to go off the rails, voice changes, distortion. But it can be good to do it once. There are reasons you might use prompts that are way too long in Bark, but they are edge cases. Prompting music or prompting to find a good voice come to mind. You don't mind that most outputs will fail in that case, and the longer prompt influences the output in a useful way. (Also, wow, bots are everywhere now...) |
when i increase the text prompt size from a specific size it creates some noisy output that is not even in the text prompt
The text was updated successfully, but these errors were encountered: