The error message you’re encountering indicates that you’ve exceeded the monthly credit allowance for Inference Providers on your Hugging Face Pro account:
> RuntimeError: API error: 402 - {“error”:“You have exceeded your monthly included credits for Inference Providers. Pay-as-you-go above your included PRO quota will be available soon.”}
This suggests that your current usage has surpassed the credits included in your subscription plan. To address this issue and continue generating inferences, consider the following options:
-
Upgrade Your Subscription:
- Review your current usage and determine if upgrading to a higher-tier plan aligns with your needs. Upgraded plans typically offer increased credit allowances, allowing for more extensive usage.
-
Wait for the Next Billing Cycle:
- Your monthly credits reset at the start of each billing cycle. If your usage is expected to decrease or if you can postpone certain tasks, waiting until your credits renew might be a viable option.
-
Contact Hugging Face Support:
- Reach out to Hugging Face’s support team to discuss your current usage and explore potential solutions. They may offer temporary accommodations or provide guidance tailored to your situation.
-
Monitor and Optimize Usage:
- Implement monitoring tools to track your API usage. By analyzing this data, you can identify patterns and optimize your workflows to be more efficient, potentially reducing unnecessary API calls.
-
Explore Alternative Solutions:
- If immediate generation is critical and other options are not feasible, consider using local inference by deploying models on your infrastructure. This approach can reduce reliance on external API calls and provide more control over your usage.
It’s important to note that Hugging Face has acknowledged that pay-as-you-go options beyond the included Pro quota will be available soon. Keeping an eye on official announcements or contacting support can provide updates on when this feature becomes available, offering more flexibility in managing your usage.
By evaluating these options, you can determine the best course of action to resume your inference tasks effectively.