Use the powerful GPT-4, Llama-3 and more AI models on Raycast, for FREE - no API Key required.
Homepage · Privacy Policy
"If you like the extension, please consider giving it a ✨star✨ tysm!" - the developer, probably
This extension is currently not available on the Raycast Extension store, but installation from source is extremely simple.
- Clone the repository / download source code from GitHub.
- Navigate to the directory, and open a Terminal window at the downloaded folder.
- Run
npm ci --production
to install required dependencies. - Run
npm run dev
to build and import the extension.
The extension, and its full set of commands, should then show up in your Raycast app.
Please open an issue if any unexpected problems occur during installation.
There is built-in support for updating within the extension itself! Simply run the "Check for Updates" command in the extension, and it will take care of the update process for you. Furthermore, you can also enable the "Automatically Check for Updates" feature in the preferences.
In the command line, run git pull
, npm ci --production
and npm run build
(in that order).
You might want to update manually if the automatic update doesn't work (please also open a GitHub issue if this is the case); updating manually also allows you to fetch and view the latest changes to the source code.
▶️ ️ Streaming support - see messages load in real-time, providing a seamless experience.- ⚡ Ask anything from anywhere - with 18 commands available, there's something for you no matter what you need.
- 💪 Support for many providers & models (more info below!)
- 💬 Chat command - interact with the AI in a conversation, and your chat history will be stored in the extension.
- 🌐 Web search - let GPT search the web for the latest information.
- 📄 File upload - you can upload image, video, audio and text files to the AI. (only available for a few providers, more to come!)
- 🎨 Image generation capabilities - imagine anything, and make it reality with state-of-the-art models.
- ✏️ Custom AI Commands - create your own commands with custom prompts!
Provider | Model | Features | Status | Speed | Rating and remarks by extension author |
---|---|---|---|---|---|
Nexra | gpt-4o (default) | Very fast | 8.5/10, the best performing model. | ||
Nexra | gpt-4-32k | Medium | 6.5/10, no streaming support but otherwise a great model. | ||
Nexra | chatgpt | Very fast | 7.5/10 | ||
Nexra | Bing | Medium | 8/10, GPT-4 based with web search capabilities. | ||
Nexra | llama-3.1 | Fast | 7/10 | ||
Nexra | gemini-1.0-pro | Fast | 6.5/10 | ||
DeepInfra | meta-llama-3.1-405b | Medium | 8.5/10, state-of-the-art open model, suitable for complex tasks. | ||
DeepInfra | meta-llama-3.1-70b | Fast | 8/10, recent model with large context size. | ||
DeepInfra | meta-llama-3.1-8b | Very fast | 7.5/10, recent model with large context size. | ||
DeepInfra | Mixtral-8x22B | Fast | 7.5/10, capable model for general use. | ||
DeepInfra | Mixtral-8x7B | Very fast | 7/10 | ||
DeepInfra | Qwen2.5-72B | Medium | 7.5/10 | ||
DeepInfra | Mistral-7B | Very fast | 6.5/10 | ||
DeepInfra | openchat-3.6-8b | Very fast | 7/10 | ||
DeepInfra | meta-llama-3-70b | Medium | 7/10 | ||
DeepInfra | meta-llama-3-8b | Very fast | 6/10 | ||
DeepInfra | gemma-2-27b | Very fast | 6.5/10 | ||
DeepInfra | WizardLM-2-8x22B | Medium | 7/10 | ||
DeepInfra | llava-1.5-7b | Fast | 6/10, supports image input | ||
Blackbox | custom model | Fast | 7.5/10, very fast generation with built-in web search ability, but is optimized for coding. | ||
Blackbox | llama-3.1-405b | Fast | 8.5/10 | ||
Blackbox | llama-3.1-70b | Very fast | 8/10 | ||
Blackbox | gemini-1.5-flash | Extremely fast | 7.5/10 | ||
Blackbox | gpt-4o | Very fast | 7.5/10 | ||
Blackbox | claude-3.5-sonnet | Fast | 8.5/10 | ||
Blackbox | gemini-pro | Fast | 8/10 | ||
DuckDuckGo | gpt-4o-mini | Very fast | 8/10, authentic GPT-4o-mini model with strong privacy. | ||
DuckDuckGo | claude-3-haiku | Extremely fast | 7/10 | ||
DuckDuckGo | meta-llama-3.1-70b | Very fast | 7.5/10 | ||
DuckDuckGo | mixtral-8x7b | Extremely fast | 7.5/10 | ||
BestIM | gpt-4o-mini | Extremely fast | 8.5/10 | ||
PizzaGPT | gpt-3.5-turbo | Extremely fast | 6.5/10 | ||
Rocks | claude-3.5-sonnet | Fast | 8.5/10 | ||
Rocks | claude-3-opus | Fast | 8/10 | ||
Rocks | gpt-4o | Fast | 7.5/10 | ||
Rocks | gpt-4 | Fast | 7.5/10 | ||
Rocks | llama-3.1-405b | Fast | 7.5/10 | ||
Rocks | llama-3.1-70b | Very Fast | 7/10 | ||
Meta AI | meta-llama-3.1 | Medium | 7/10, recent model with internet access. | ||
Replicate | mixtral-8x7b | Medium | ?/10 | ||
Replicate | meta-llama-3.1-405b | Medium | ?/10 | ||
Replicate | meta-llama-3-70b | Medium | ?/10 | ||
Replicate | meta-llama-3-8b | Fast | ?/10 | ||
Google Gemini | auto (gemini-1.5-pro, gemini-1.5-flash) | Very fast | 8/10, very good overall model but requires an API Key. (It's free, see the section below) | ||
GPT4Free Local API | - | - | allows access to a large variety of providers. read more | ||
Ollama Local API | - | - | allows local inference. read more | ||
Custom OpenAI-compatible API | - | - | allows you to use any custom OpenAI-compatible API. read more |
📄 - Supports file upload.
¹: Supports images only.
- Google Gemini: An API Key is required to use this model. You can get one completely for free:
- Go to https://aistudio.google.com/app/apikey
- Sign in to your Google account if you haven't done so.
- Click on "Create API Key" and follow the instructions there.
- Copy the API Key and paste it into the corresponding box in the extension preferences.
The rate limit for Google Gemini is 1500 requests per day (as of the time of writing). This should be much more than enough for any normal usage. If your use case needs an increased rate limit, you can even create multiple API Keys with different Google accounts; separate them with commas in the preferences.
- Google Gemini: This provider supports File upload functionality, as well as the Ask About Screen Content command! To upload a file in AI Chat, press Command-Enter or select "Compose Message" from the actions. Then, simply click on the upload button to get started.
Let GPT decide to search the web for information if it does not have enough knowledge or context.
- How to enable: Go to the extension preferences and check the "Enable Web Search" box.
- Setting up: An API Key is required to use this feature. You can get one completely for free:
- Go to https://app.tavily.com
- Sign in with a Google/GitHub account.
- Copy the API Key and paste it into the corresponding box in the extension preferences.
The rate limit for the web search feature is around 500 requests per month (as of the time of writing). This should be enough for most users, but similarly you can also create multiple API Keys with different accounts; separate them with commas in the preferences.
Let GPT automatically come up with a name for the current chat session after you send the first message. For example, this is similar to what the ChatGPT web UI does.
Let the extension automatically check for updates every day. If a new version is available, you will be notified, along with the option to update the extension with a single click.
Enable more persistent storage of the extension's data, like AI Chat data or Custom Commands. This will back up a copy of this data to files on your computer. Useful for saving large amounts of data. Note: With this option off, your data is already well preserved. Do not enable this if you have sensitive data.
Allows GPT to execute Python code locally. The model has been instructed to strictly only produce safe code, but use at your own risk!
Only models with function calling capabilities support this feature. Currently, this includes only selected DeepInfra models.
- I submitted it when the extension was in its early stages, but since Raycast is quite conservative about adding AI extensions, it was rejected because of concerns over the use of third-party APIs. I think this is a valid concern, and I'd like to be very transparent about where your data is going to, so I'll be writing all the details in the project homepage really soon. But do rest assured that I use safe APIs from legitimate websites; I also update the extension very frequently so all the providers I use will be up-to-date.
- Thus, the extension will have to be installed from source. Regarding this, I apologize as it's indeed more complicated than downloading it from the store. I have tried my best to make the installation process quick and streamlined - please do provide feedback on whether it was simple enough!
- First and foremost, raycast-g4f is completely free! I strongly believe that such AI features, especially in an awesome productivity tool like Raycast, shouldn't be locked behind a paywall.
- UI-wise, the biggest difference is probably the chat GUI. Because the developer API that Raycast provides is limited, it's not possible to replicate the Raycast AI interface exactly. The GUI used in my extension is really intuitive however, and it's also used by a lot of AI extensions on raycast.
- Regarding the models available, and the AI quality: Raycast AI definitely has more model options - after all, money talks ;) But please rest assured that the quality of AI responses in my extension is by no means subpar! Some models available are gpt-3.5-turbo, gpt-4 (authentic!), Google Gemini, as well as large & capable open-source models like Llama 3.1 and Mixtral-8x22B. If you still doubt it, I'd encourage you to give the extension a try! :-)
License: GPLv3. Full license is found in LICENSE.txt.
The code base is derived from Raycast Gemini by Evan Zhou.
Third-party libraries used for generation:
(Both packages are maintained by the extension author.)
Some of the code in this repository was inspired or ported from the original gpt4free project (written in Python).