Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!


Shells Virtual Desktop
BMail.ag - Secure Email Service
Server.net
CPLicense.net
VPS Server
Buy VPN
Vultr
VMs for AI
HostDare
ReliableSite White-Label Dedicated Hosting for Resellers
InterServer VPS
BMail.ag - Secure Email Service
Best VPN
High-Performance Bare Metal Server Solutions
Karvl.com
Server Mania Cloud Hosting
DataWagon Hosting
AlphaVPS Hosting
Evoxt.com
Clouvider
VPS Hosting with NVMe
Residential IPs in the US & 4G Mobile Proxies in EU & US with Unlimited Bandwidth
ReliableSite White-Label Dedicated Hosting for Resellers
Rabisu - Hosting Solutions
Shells Virtual Desktop
New on LowEndTalk? Please Register and read our Community Rules.

All new Registrations are manually reviewed and approved, so a short delay after registration may occur before your account becomes active.

AI Agents

DediRockDediRock Member, Patron Provider

Has anyone successfully written their own AI agent on their local machine? if so what computer specs did you actually need/have?

Comments

  • Not , but yes with apis

    Thanked by 1DediRock
  • I tried running the Gemma 3 4b model on an RTX 3060 12GB. The performance wasn’t fully satisfying, but it wasn’t bad either.

    Thanked by 3DediRock oloke mandala
  • DediRockDediRock Member, Patron Provider

    @woinokiz said:
    Not , but yes with apis

    gotcha have you looked into building your own on your local machine?

  • DediRockDediRock Member, Patron Provider

    @nekomikoreimu said:
    I tried running the Gemma 3 4b model on an RTX 3060 12GB. The performance wasn’t fully satisfying, but it wasn’t bad either.

    okay that's cool so what were you running your agent for? and then how did it compare to just using say chat GPT on the web browser

  • wdmgwdmg Member, LIR

    LM Studio is great if you just want to mess around. Ollama's good as well if you want a (imo) nicer API.

    Thanked by 1DediRock
  • rattlecattlerattlecattle Member
    edited July 2025

    AI agents are API calls under the hood with some added features like tool calling and mcp support which otherwise are not usually available on a chat based web interface. The API calls can be to a remote endpoint like OpenAI, OpenRouter etc or a self hosted Ollama instance - doesn't matter. Most of them are OpenAI compatible.

    To build such an agent can use simple plain HTTP requests (like python requests). Tool and mcp details can be included in the system prompt itself. Then you would need to parse the response and if check if its a tool call, execute the tool implementation yourself and send the result back to the LLM (another HTTP request actually). LLM's are stateless - do not maintain session, so in each request need to include the past chat history.

    This is too much bolierplate, so there are existing AI agent framework like langchain, CrewAI, AgnoAI, autogen on the Python side and myriad of other framework popping up almost daily.


    From the specs perspective, a small VPS instance will work fine except for running Ollama.

  • DediRockDediRock Member, Patron Provider

    @wdmg said:
    LM Studio is great if you just want to mess around. Ollama's good as well if you want a (imo) nicer API.

    okay awesome thanks yeah I used ollama 3. I created one to help me with my email accounts. But that model would mistake IP addresses and say cigars. small funny stuff like that so I looked into it and I just need a more powerful machine so I can use a different model

    Thanked by 1mandala
  • emaiIemaiI Member
    edited July 2025

    @DediRock said:

    @wdmg said:
    LM Studio is great if you just want to mess around. Ollama's good as well if you want a (imo) nicer API.

    okay awesome thanks yeah I used ollama 3. I created one to help me with my email accounts. But that model would mistake IP addresses and say cigars. small funny stuff like that so I looked into it and I just need a more powerful machine so I can use a different model

    Don't do that for anything serious... why not APIs of big providers?

    Thanked by 1DediRock
  • @DediRock said:

    @nekomikoreimu said:
    I tried running the Gemma 3 4b model on an RTX 3060 12GB. The performance wasn’t fully satisfying, but it wasn’t bad either.

    okay that's cool so what were you running your agent for? and then how did it compare to just using say chat GPT on the web browser

    I remember I once linked it with Copilot in VS Code. Honestly, the web-browser version of ChatGPT seems to perform better. The only real advantage is that you can use it offline, I guess.

    Thanked by 1DediRock
  • @DediRock said:

    @woinokiz said:
    Not , but yes with apis

    gotcha have you looked into building your own on your local machine?

    Not , but last time I was trying to clone my video then electricity had issues , never seen after that

    Thanked by 1DediRock
  • DediRockDediRock Member, Patron Provider

    @rattlecattle said:
    AI agents are API calls under the hood with some added features like tool calling and mcp support which otherwise are not usually available on a chat based web interface. The API calls can be to a remote endpoint like OpenAI, OpenRouter etc or a self hosted Ollama instance - doesn't matter. Most of them are OpenAI compatible.

    To build such an agent can use simple plain HTTP requests (like python requests). Tool and mcp details can be included in the system prompt itself. Then you would need to parse the response and if check if its a tool call, execute the tool implementation yourself and send the result back to the LLM (another HTTP request actually). LLM's are stateless - do not maintain session, so in each request need to include the past chat history.

    This is too much bolierplate, so there are existing AI agent framework like langchain, CrewAI, AgnoAI, autogen on the Python side and myriad of other framework popping up almost daily.


    From the specs perspective, a small VPS instance will work fine except for running Ollama.

    Okay well that's a little more in-depth than my current knowledge :) I will definitely research this a bit more. Okay perfect, did research about a month ago or so now I did not see anything about Lang chain crew AI or the other two you had said, however there was nothing out the box that would do what I wanted to do I use Outlook 2021 currently my local machine. So it seems like some sort of custom-built solution or some variation of it was the only way to do it I have more than just one email box I use in my Outlook, Outlook 365 has autopilot but is nowhere near what I need at least I think but thank you very much that's good stuff for me to read up on

    Thanked by 1rattlecattle
  • Sounds like you need N8N: https://github.com/n8n-io/n8n

    It has integrations for accessing services like Outlook and AI APIs so you can create workflows to read/send emails etc

    Thanked by 2DediRock whiterider
  • DediRockDediRock Member, Patron Provider

    @emaiI said:

    @DediRock said:

    @wdmg said:
    LM Studio is great if you just want to mess around. Ollama's good as well if you want a (imo) nicer API.

    okay awesome thanks yeah I used ollama 3. I created one to help me with my email accounts. But that model would mistake IP addresses and say cigars. small funny stuff like that so I looked into it and I just need a more powerful machine so I can use a different model

    Don't do that for anything serious... why not APIs of big providers?

    My understanding, you can't use the bigger ones because you had to make some big old API to call their system, it doesn't live locally on your machine. At least that was my understanding.

  • DediRockDediRock Member, Patron Provider

    @CloudHopper said:
    Sounds like you need N8N: https://github.com/n8n-io/n8n

    It has integrations for accessing services like Outlook and AI APIs so you can create workflows to read/send emails etc

    That is perfect. Thank you!

  • DediRockDediRock Member, Patron Provider

    @nekomikoreimu said:

    @DediRock said:

    @nekomikoreimu said:
    I tried running the Gemma 3 4b model on an RTX 3060 12GB. The performance wasn’t fully satisfying, but it wasn’t bad either.

    okay that's cool so what were you running your agent for? and then how did it compare to just using say chat GPT on the web browser

    I remember I once linked it with Copilot in VS Code. Honestly, the web-browser version of ChatGPT seems to perform better. The only real advantage is that you can use it offline, I guess.

    Right, but technically it would be faster if your computer was strong enough to run its own version or engine right? Instead of an API calling ChatGPT, downloading etc. Then there's the problem that ChatGPT does not store all of your data. You had 30 gigabytes of data that you wanted your Agent to pull from, ChatGPT would not be able to do that. It'd be limited correct?

  • Created with n8n on vps. Very easy.

    Thanked by 1DediRock
  • DediRockDediRock Member, Patron Provider

    @woinokiz said:

    @DediRock said:

    @woinokiz said:
    Not , but yes with apis

    gotcha have you looked into building your own on your local machine?

    Not , but last time I was trying to clone my video then electricity had issues , never seen after that

    what do you mean your electricity had issues?

  • @DediRock said:

    @woinokiz said:

    @DediRock said:

    @woinokiz said:
    Not , but yes with apis

    gotcha have you looked into building your own on your local machine?

    Not , but last time I was trying to clone my video then electricity had issues , never seen after that

    what do you mean your electricity had issues?

    You would know if you were from tier 3 country

    Thanked by 1DediRock
  • DediRockDediRock Member, Patron Provider

    @DrNutella said:
    Created with n8n on vps. Very easy.

    yes, it sounds like you're a seasoned coder though :)

    Thanked by 1DrNutella
  • @DediRock said:

    @DrNutella said:
    Created with n8n on vps. Very easy.

    yes, it sounds like you're a seasoned coder though :)

    JSON at best in this scenario

    Thanked by 1DediRock
  • Don't know i just use copilot student account.

    Thanked by 1DediRock
  • DediRockDediRock Member, Patron Provider

    @woinokiz said:

    @DediRock said:

    @woinokiz said:

    @DediRock said:

    @woinokiz said:
    Not , but yes with apis

    gotcha have you looked into building your own on your local machine?

    Not , but last time I was trying to clone my video then electricity had issues , never seen after that

    what do you mean your electricity had issues?

    You would know if you were from tier 3 country

    tracking

  • DediRockDediRock Member, Patron Provider

    @adanforest said:
    Don't know i just use copilot student account.

    gotcha that's only available though on Microsoft 365, I believe? how many email accounts do you have?

  • @DediRock said:

    @adanforest said:
    Don't know i just use copilot student account.

    gotcha that's only available though on Microsoft 365, I believe? how many email accounts do you have?

    Don't need Microsoft 365, i'm using copilot agent on VS Code

    Thanked by 1DediRock
  • Peppery9Peppery9 Member
    edited July 2025

    @DediRock said:

    @nekomikoreimu said:

    @DediRock said:

    @nekomikoreimu said:
    I tried running the Gemma 3 4b model on an RTX 3060 12GB. The performance wasn’t fully satisfying, but it wasn’t bad either.

    okay that's cool so what were you running your agent for? and then how did it compare to just using say chat GPT on the web browser

    I remember I once linked it with Copilot in VS Code. Honestly, the web-browser version of ChatGPT seems to perform better. The only real advantage is that you can use it offline, I guess.

    Right, but technically it would be faster if your computer was strong enough to run its own version or engine right? Instead of an API calling ChatGPT, downloading etc. Then there's the problem that ChatGPT does not store all of your data. You had 30 gigabytes of data that you wanted your Agent to pull from, ChatGPT would not be able to do that. It'd be limited correct?

    I think you're getting a bit mixed up with a local model verses an agent. Ollama or LM Studio are good options for running models locally - in general, you want as many GPUs with as much VRAM as you can get your hands on, and then some. Apple Silicon Macs are also a good choice with their unified memory. You can get a lot out of a small model on consumer hardware but temper your expectations accordingly, don't expect anywhere near ChatGPT-level performance or knowledge. Per-token API pricing is typically cheap enough that it's hard to justify a big investment in hardware.

    An agent on the other hand is just an LLM with tools it can use. The tools can run on your local machine even if the model isn't, and they're very lightweight as they don't do any heavy lifting. Claude Desktop and VSCode (+others) can use MCP servers to interact with local apps, databases, files, etc on your machine. There's lots to choose from, or you could always write your own.

    30GB is a lot of data for an LLM to process, and far outside any the context window of any model. Depending on what you're trying to do you might need to look into RAG.

    @DediRock said:
    Outlook 365 has autopilot but is nowhere near what I need at least I think but thank you very much that's good stuff for me to read up on

    I have a work-provided Microsoft 365 Copilot subscription. I get some AI summaries and quick reply shortcuts in Outlook and chat buttons everywhere. I find it borderline useless.

    Thanked by 1DediRock
  • not worth ROI.

    Thanked by 1DediRock
  • DediRockDediRock Member, Patron Provider

    @adanforest said:

    @DediRock said:

    @adanforest said:
    Don't know i just use copilot student account.

    gotcha that's only available though on Microsoft 365, I believe? how many email accounts do you have?

    Don't need Microsoft 365, i'm using copilot agent on VS Code

    wow you're right, I thought you needed Microsoft 365 to have co-pilot. Reading into this now, thank you

  • DediRockDediRock Member, Patron Provider

    @Peppery9 said:

    @DediRock said:

    @nekomikoreimu said:

    @DediRock said:

    @nekomikoreimu said:
    I tried running the Gemma 3 4b model on an RTX 3060 12GB. The performance wasn’t fully satisfying, but it wasn’t bad either.

    okay that's cool so what were you running your agent for? and then how did it compare to just using say chat GPT on the web browser

    I remember I once linked it with Copilot in VS Code. Honestly, the web-browser version of ChatGPT seems to perform better. The only real advantage is that you can use it offline, I guess.

    Right, but technically it would be faster if your computer was strong enough to run its own version or engine right? Instead of an API calling ChatGPT, downloading etc. Then there's the problem that ChatGPT does not store all of your data. You had 30 gigabytes of data that you wanted your Agent to pull from, ChatGPT would not be able to do that. It'd be limited correct?

    I think you're getting a bit mixed up with a local model verses an agent. Ollama or LM Studio are good options for running models locally - in general, you want as many GPUs with as much VRAM as you can get your hands on, and then some. Apple Silicon Macs are also a good choice with their unified memory. You can get a lot out of a small model on consumer hardware but temper your expectations accordingly, don't expect anywhere near ChatGPT-level performance or knowledge. Per-token API pricing is typically cheap enough that it's hard to justify a big investment in hardware.

    An agent on the other hand is just an LLM with tools it can use. The tools can run on your local machine even if the model isn't, and they're very lightweight as they don't do any heavy lifting. Claude Desktop and VSCode (+others) can use MCP servers to interact with local apps, databases, files, etc on your machine. There's lots to choose from, or you could always write your own.

    30GB is a lot of data for an LLM to process, and far outside any the context window of any model. Depending on what you're trying to do you might need to look into RAG.

    @DediRock said:
    Outlook 365 has autopilot but is nowhere near what I need at least I think but thank you very much that's good stuff for me to read up on

    I have a work-provided Microsoft 365 Copilot subscription. I get some AI summaries and quick reply shortcuts in Outlook and chat buttons everywhere. I find it borderline useless.

    yeah you're right on that, the definitions of those two need to be cleared and understood thoroughly. It seems like technical things, finding good definitions of things can be a challenge. but thank you for that, seems like trying to get towards a chat GPT type quality on your local machine is not something that's easily obtainable or done.

  • DediRockDediRock Member, Patron Provider

    @Motion3549 said:
    not worth ROI.

    yeah seems to be that way right now.

Sign In or Register to comment.