Trendshift - Ask AI

base on A Node for ComfyUI that does what you ask it to do # AnyNode v0.1 (🍄 beta) A ComfyUI Node that uses the power of LLMs to do anything with your input to make any type of output. ![image](https://github.com/lks-ai/anynode/assets/163685473/43043c8f-24f6-4693-bc9e-43666cda78b3) ![image](https://github.com/lks-ai/anynode/assets/163685473/a0596d98-911e-4a93-b0f7-6f6a8782d49d) ![image](https://github.com/lks-ai/anynode/assets/163685473/c2571a37-c1f2-4ce7-b44b-f2420fe0e3f9) [![Watch the video](https://img.youtube.com/vi/f52K5pkbZy8/maxresdefault.jpg)](https://youtu.be/f52K5pkbZy8) ### [📺 More Tutorials on AnyNode at YouTube](https://www.youtube.com/watch?v=f52K5pkbZy8&list=PL-EiB44NKrkcxJnR9MwD4hOSZOTlHn6Tr) ### [Join our Discord](https://discord.gg/RFpe6gsK5x) ## Install 1. Clone this repository into `comfy/custom_nodes` *or* Just search for `AnyNode` on ComfyUI Manager 2. If you're using openAI API, follow the OpenAI instructions 3. If you're using Gemini, follow the Gemini Instructions 4. If you're using LocalLLMs API, make sure your LLM server (ollama, etc.) is running 5. Restart Comfy 6. In ComfyUI double-click and search for `AnyNode` or you can find it in Nodes > utils ### OpenAI Instructions 1. Make sure you have the `openai` module installed through pip: `pip install openai` 2. Add your `OPENAI_API_KEY` variable to your Environment Variables. [How to get your OpenAI API key](https://platform.openai.com/docs/quickstart) `AnyNode 🍄` Is the node that directly uses OpenAI with the latest ChatGPT (whichever that may be at the time) ### Gemini Instructions 1. You don't need any extra module, so don't worry about that 2. Add your `GOOGLE_API_KEY` variable to your Environment Variables. [How to get your Google API key](https://aistudio.google.com/app/apikey) `AnyNode 🍄 (Gemini)` is still being tested so it probably contains bugs. I will update this today. ## Local LLMs ![Screenshot from 2024-05-27 13-32-58](https://github.com/lks-ai/anynode/assets/163685473/70cb508e-b2af-470a-b777-1ddebe1cd59c) We now have an `AnyNode 🍄 (Gemini)` Node and our big star: The `AnyNode 🍄 (Local LLM)` Node. This was the most requested feature since Day 1. The classic `AnyNode 🍄` will still use OpenAI directly. - You can set each LocalLLM node to use a different local or hosted service as long as it's OpenAI compatible - This means you can use [Ollama](https://ollama.com/), [vLLM](https://github.com/vllm-project/) and any other LocalLLM server from wherever you want ### A Note about Security for the Local LLM variant The way that AnyNode works, is that it executes code which happens externally from python that is coming back from the `server` on a ChatCompletions endpoint. To put that into perspective, wherever you point it, you are giving some sort of control in python to that place. **BE CAREFUL** that if you are not pointing it to `localhost` that you absolutely trust the address that you put into `server`. ## How it Works 1. Put in what you want the node to do with the input and output. 2. Connect it up to anything on both sides 3. Hit `Queue Prompt` in ComfyUI AnyNode codes a python function based on your request and whatever input you connect to it to generate the output you requested which you can then connect to compatible nodes. ![image](https://github.com/lks-ai/anynode/assets/163685473/1245aa94-fa4d-4490-a3f4-5e8b9918ca28) ## Update: It can make you a sandwich ![Courtesy of Synthetic Ape](https://github.com/lks-ai/anynode/assets/163685473/fc40a7fe-61d4-4347-aeec-80d5b34ddf4f) *Courtesy of [synthetic ape](https://www.tiktok.com/@synthetic.ape)* **Warning**: Because of the ability to link *ANY* node, you can crash ComfyUI if you are not careful. ### 🛡️ Security Features You shouldn't trust an LLM with your computer, and we don't either. **Code Sanizitzer** Every piece of code that the LLM outputs [goes through a sanitizer](https://github.com/lks-ai/anynode/blob/d2d466bdcf6ef162aef503b3c79e135b7a33c349/nodes/utils.py#L149) before being allowed to be loaded into the environment or executed. You will see errors about dangerous code... that's the sanitizer. **No Internet, No Files, No Command Line** As a safety feature, AnyNode does not have the ability to generate functions that browse the internet or touch the files on your computer. If you need to load something into comfy or get stuff from the internet, there are plenty of loader nodes available in popular node packs on Manager. **Curated Imports** We only let AnyNode use libraries from the list of [Allowed Imports](https://github.com/lks-ai/anynode/blob/d2d466bdcf6ef162aef503b3c79e135b7a33c349/nodes/any.py#L120). Anything else will not even be within the function's runtime environment and will give you an error. This is a feature. If you want libraries you don't see in that list to be added to AnyNode, let us know on the Discord or open an Issue. **Note**: AnyNode can use the openai and google generativeAI libraries in the functions it generates, so you can ask it to use the latest from OpenAI by pasting an example from their API and get it to stream a TTS audio file to your computer, that is a supported library and it's fine. ## 🤔 Caveats - I have no idea how far you can take this nor it's limits - LLMs can't read your mind. To make complex stuff in one node you'd have to know a bit about programming - The smaller the LLM you use to code your nodes, the less coding skills it might have - Right now you can only see code the LLM generates in the console - ~~Can't make a sandwich~~ ## 💪 Strengths - Use OpenAI `AnyNode 🍄`, Local LLMs `AnyNode 🍄 (Local LLM)`, Gemini `AnyNode 🍄 (Gemini)` - You can use as many of these as you want in your workflow creating possibly complex node groups - Really great at single purpose nodes - Uses OpenAI API for simple access to the latest and greatest in generation models - Technically you could point this at vLLM. LM Studio or Ollama for you LocalLLM fans - Can use most of the popular python libraries and most standard like (numpy, torch, collections, re) - Ability to make more complex nodes that use inputs like MODEL, VAE and CLIP with input type awareness - Error Mitigation: Auto-correct errors it made in code (just press `Queue Prompt` again) - Incremental Code editing (the last generated function serves as example for next generation) - Copy cool nodes you prompt is as easy as copying the workflow - Saves generated functions registry `json` to `output/anynode` so you can bundle it with workflow - Can make more complex functions with two optional inputs to the node. - **IT CAN MAKE A SANDWICH!** ## 🛣️ Roadmap - **Export to Node**: Compile a new comfy node from your AnyNode (Requires restart to use your new node) - Downstream Error Mitigation: Perform error mitigation on outputs to other nodes (expectation management) - RAG based function storage and semantic search across comfy modules (not a pipe dream) - Persistent data storage in the AnyNode (functions store extra data for iterative processing or persistent memory) - Expanding [*NodeAware*](https://github.com/lks-ai/anynode/blob/main/nodes/util_nodeaware.py#L1) to include full Workspace Awareness - Node Recommendations: AnyNode will recommend you or even load some nodes into the workflow based on your input ## Coding Errors you Might Encounter As with any LLMs or text generating language model, when it comes to coding, it can sometimes make mistakes that it can't fix by itself even if you show it the error of it's ways. A lot of these can be mitigated by modifying your prompt. If you encounter some of the known ones, we have some prompt engineering solutions here for you. For this I recommend that you [Join our Discord](https://discord.gg/RFpe6gsK5x) and report the bug there. Often times AnyNode will fix the bug if it happened within your generated function if you just click `Queue Prompt` again. ## If you're still here Let's enjoy some stuff I made while up all night! ![image](https://github.com/lks-ai/anynode/assets/163685473/02801f5c-9f67-40f1-83a7-a93e6103d362) This one, well... the prompts explain it all, but TLDR; It takes an image as input and outputs only the red channel of that image. ![Screenshot from 2024-05-26 01-30-40](https://github.com/lks-ai/anynode/assets/163685473/4cfe5b0b-d515-4f9d-9d86-eff1a08595ed) Here I use three AnyNodes: One to load a file, one to summarize the text in that file, and the other to just do some parsing of that text. No coding needed. ![image](https://github.com/lks-ai/anynode/assets/163685473/4bc5c6c0-ca56-4f4c-88d5-5339b6d5ada1) I took that Ant example a bit further and added in the normal nodes to do img2img with my color transforms from AnyNode ![Screenshot from 2024-05-26 20-45-57](https://github.com/lks-ai/anynode/assets/163685473/0e02ae11-7e46-4d50-8645-fe7a5d3c46c9) Here I ask for an instagram-like sepia tone filter for my AnyNode ... I titled the node Image Filter just so I can remember what it's supposed to be doing in the workflow ![image](https://github.com/lks-ai/anynode/assets/163685473/b8879685-6a78-4314-a8e4-5d88d046621d) Let's try a much more complex description of an HSV transform, but still in plain english. And we get a node that will randomly filter HSV every time it's run! [Here's that workflow](workflows/anynode_hsl-tweak.json) ![Screenshot from 2024-05-26 21-05-25](https://github.com/lks-ai/anynode/assets/163685473/c00531c9-c93a-471a-bca0-bb62abea4943) Then I ask for a more legacy instagram filter (normally it would pop the saturation and warm the light up, which it did!) ![image](https://github.com/lks-ai/anynode/assets/163685473/dda13811-7e0e-4d9e-ab7c-fd2ff3d594ba) How about a psychedelic filter? ![image](https://github.com/lks-ai/anynode/assets/163685473/29db4cd9-db77-4931-a340-10755e0211fa) Here I ask it to make a "sota edge detector" for the output image, and it makes me a pretty cool Sobel filter. And I pretend that I'm on the moon. [Here's that workflow](workflows/sobel-charcoal.json) ", Assign "at most 3 tags" to the expected json: {"id":"10437","tags":[]} "only from the tags list I provide: [{"id":77,"name":"3d"},{"id":89,"name":"agent"},{"id":17,"name":"ai"},{"id":54,"name":"algorithm"},{"id":24,"name":"api"},{"id":44,"name":"authentication"},{"id":3,"name":"aws"},{"id":27,"name":"backend"},{"id":60,"name":"benchmark"},{"id":72,"name":"best-practices"},{"id":39,"name":"bitcoin"},{"id":37,"name":"blockchain"},{"id":1,"name":"blog"},{"id":45,"name":"bundler"},{"id":58,"name":"cache"},{"id":21,"name":"chat"},{"id":49,"name":"cicd"},{"id":4,"name":"cli"},{"id":64,"name":"cloud-native"},{"id":48,"name":"cms"},{"id":61,"name":"compiler"},{"id":68,"name":"containerization"},{"id":92,"name":"crm"},{"id":34,"name":"data"},{"id":47,"name":"database"},{"id":8,"name":"declarative-gui "},{"id":9,"name":"deploy-tool"},{"id":53,"name":"desktop-app"},{"id":6,"name":"dev-exp-lib"},{"id":59,"name":"dev-tool"},{"id":13,"name":"ecommerce"},{"id":26,"name":"editor"},{"id":66,"name":"emulator"},{"id":62,"name":"filesystem"},{"id":80,"name":"finance"},{"id":15,"name":"firmware"},{"id":73,"name":"for-fun"},{"id":2,"name":"framework"},{"id":11,"name":"frontend"},{"id":22,"name":"game"},{"id":81,"name":"game-engine "},{"id":23,"name":"graphql"},{"id":84,"name":"gui"},{"id":91,"name":"http"},{"id":5,"name":"http-client"},{"id":51,"name":"iac"},{"id":30,"name":"ide"},{"id":78,"name":"iot"},{"id":40,"name":"json"},{"id":83,"name":"julian"},{"id":38,"name":"k8s"},{"id":31,"name":"language"},{"id":10,"name":"learning-resource"},{"id":33,"name":"lib"},{"id":41,"name":"linter"},{"id":28,"name":"lms"},{"id":16,"name":"logging"},{"id":76,"name":"low-code"},{"id":90,"name":"message-queue"},{"id":42,"name":"mobile-app"},{"id":18,"name":"monitoring"},{"id":36,"name":"networking"},{"id":7,"name":"node-version"},{"id":55,"name":"nosql"},{"id":57,"name":"observability"},{"id":46,"name":"orm"},{"id":52,"name":"os"},{"id":14,"name":"parser"},{"id":74,"name":"react"},{"id":82,"name":"real-time"},{"id":56,"name":"robot"},{"id":65,"name":"runtime"},{"id":32,"name":"sdk"},{"id":71,"name":"search"},{"id":63,"name":"secrets"},{"id":25,"name":"security"},{"id":85,"name":"server"},{"id":86,"name":"serverless"},{"id":70,"name":"storage"},{"id":75,"name":"system-design"},{"id":79,"name":"terminal"},{"id":29,"name":"testing"},{"id":12,"name":"ui"},{"id":50,"name":"ux"},{"id":88,"name":"video"},{"id":20,"name":"web-app"},{"id":35,"name":"web-server"},{"id":43,"name":"webassembly"},{"id":69,"name":"workflow"},{"id":87,"name":"yaml"}]" returns me the "expected json"

AI prompts