Back to browse
AI agents run my one-person company on Gemini's free tier – $0/month

AI agents run my one-person company on Gemini's free tier – $0/month

by ppcvote·Mar 8, 2026·16 points·35 comments

AI Analysis

●●●BangerBig BrainShip ItSolve My Problem

$0/month AI ops fleet with token-optimization tricks and real production metrics.

Strengths
  • Token-optimization architecture (pre-computed intelligence files, zero-conversation design) genuinely reduces Gemini 2.5 Flash usage to ~105 requests daily.
  • Ships 4 live products (MindThread SaaS, AI Agent fleet, UltraProbe, Multi-LLM routing) with verifiable metrics: 923K+ views, 20+ accounts, zero downtime.
  • Agents run on WSL2 at home via systemd timers—no cloud dependency, reproducible for others to fork.
Weaknesses
  • Documentation focuses on marketing (in Mandarin/English mix) rather than technical architecture; hard to extract the actual agent prompts or retry logic from landing page.
  • No published code repository or GitHub link despite OpenClaw being listed as 'open source'; limits ability to fork or verify claims.
Target Audience

Solo founders, small tech agencies, developers building cost-efficient AI automation

Similar To

Continue.dev (Copilot for IDE) · Cursor (context-aware coding) · n8n (workflow automation)

Post Description

I'm a solo dev in Taiwan. I built 4 AI agents that handle content, sales leads, security scanning, and ops for my tech agency — all on Gemini 2.5 Flash free tier (1,500 req/day). I use ~105. Monthly LLM cost: $0.

Architecture: 4 agents on OpenClaw (open source), running on WSL2 at home with 25 systemd timers.

What they do every day:

- Generate 8 social posts across platforms (quality-gated: generate → self-review → rewrite if score < 7/10) - Engage with community posts and auto-reply to comments (context-aware, max 2 rounds) - Research via RSS + HN API + Jina Reader → feed intelligence back into content - Run UltraProbe (AI security scanner) for lead generation - Monitor 7 endpoints, flag stale leads, sync customer data - Auto-post blog articles to Discord when I git push (0 LLM tokens — uses commit message directly)

The token optimization trick: agents never have long conversations. Every request is (1) read pre-computed intelligence files (local markdown, 0 tokens), (2) one focused prompt with all context injected, (3) one response → parse → act → done. The research pipeline (RSS, HN, web scraping) costs 0 LLM tokens — it's pure HTTP + Jina Reader. The LLM only touches creative/analytical work.

Real numbers:

- 27 automated Threads accounts, 12K+ followers, 3.3M+ views - 25 systemd timers, 62 scripts, 19 intelligence files - RPD utilization: 7% (105/1,500) — 93% headroom left - Monthly cost: $0 LLM + ~$5 infra (Vercel hobby + Firebase free)

What went wrong:

- $127 Gemini bill in 7 days. Created an API key from a billing-enabled GCP project instead of AI Studio. Thinking tokens ($3.50/1M) with no rate cap. Lesson: always create keys from AI Studio directly. - Engagement loop bug: iterated ALL posts instead of top N. Burned 800 RPD in one day and starved everything else. - Telegram health check called getUpdates, conflicting with the gateway's long-polling. 18 duplicate messages in 3 minutes.

The site (https://ultralab.tw) is fully bilingual (zh-TW/en) with 21 blog posts, and yes — the i18n, blog publishing, and Discord notifications are all part of the automated pipeline.

Live agent dashboard: https://ultralab.tw/agent

Stack: OpenClaw, Gemini 2.5 Flash (free), WSL2/systemd, React/TypeScript/Vite, Vercel, Firebase, Telegram Bot, Resend, Jina Reader.

GitHub (playbook): https://github.com/UltraLabTW/free-tier-agent-fleet

Happy to answer questions about the architecture, token budgeting, or what it's actually like running AI agents 24/7 as a one-person company.

Similar Projects

SaaSPass

Running your own AI assistant for €19/month

The post nails the hard-to-communicate bit: real costs and failure modes. It lays out a concrete €19/month hosting option, shows how Google Gemini's free tier and cheaper embedding/semantic search options can keep API bills tiny, and calls out the real upkeep costs people underestimate (backups, lost conversation history). It’s more a practical buyer’s guide + hosting pitch than a technical breakthrough, but the numbers and comparisons are legitimately useful if you’re on the fence about DIY.

Solve My ProblemNiche Gem
yixn_io
113mo ago

Galatea – Real personality for your AI agent

Galatea maps personality descriptions (everything from 'Sherlock Holmes' to original personas) into concrete, agent-specific config files like CLAUDE.md, .cursorrules and GEMINI.md, plus preview interactions and a ZIP you can drop into a project — that practical, export-first flow is the product's strongest move. It isn't a category killer (persona/prompt managers already exist), but the per-agent format mapping and preview/download pipeline make it a useful, tactile tool; watch out for copyright and ethics when reusing famous personalities.

Niche GemSlick
ianpcook
313mo ago