Merge issues fixed

Fix error handling, resource leaks, and logic bugs across the SDK
Bugs fixed: - files.py: use typed error checking (_raise_for_status) instead of raw raise_for_status(), ensuring WrennNotFoundError etc. are raised correctly - exceptions.py: check both "capsule_ids" and "sandbox_ids" response keys for backwards compatibility - code_interpreter: retry _ensure_kernel on 5xx errors (only fail on 4xx), remove redundant TimeoutError in bare except, clean up non-standard top-level msg_id/msg_type from Jupyter messages Resource leaks fixed: - capsule.py: close WrennClient if capsule creation or init fails - code_interpreter: add close()/__del__ for _proxy_client cleanup when not using context manager Logic fixes: - pty.py: yield exit events to callers instead of silently discarding them - capsule.py: auto-resume paused capsules in wait_ready instead of failing - capsule.py: log warnings on destroy failure in __exit__ instead of silently swallowing errors
2026-05-02 21:46:16 +06:00 · 2026-05-02 21:34:02 +06:00 · 2026-05-02 19:02:39 +06:00 · 2026-04-24 00:01:20 +06:00 · 2026-04-23 23:53:15 +06:00 · 2026-04-23 12:38:41 +00:00
2 changed files with 1 additions and 172 deletions
--- a/CLAUDE.md
+++ b/CLAUDE.md
@ -1,171 +0,0 @@
-## Design Context
-
-### Users
-Developers across the full spectrum — solo engineers building side projects, startup teams integrating sandboxed execution into products, and platform/infra engineers at larger organizations running production workloads on Firecracker microVMs. They arrive with context: they know what a process is, what a rootfs is, what a TTY means. The interface must feel at home for all three: approachable enough not to intimidate a hacker, precise enough to earn the trust of a production ops team. Never condescend, never oversimplify. Trust the user to understand what they're looking at.
-
-**Primary job to be done:** Understand what's running, act on it confidently, and get back to code.
-
-### Brand Personality
-**Precise. Warm. Uncompromising.**
-
-Wrenn is an engineer's favorite tool — built with visible care, not assembled from defaults. It runs real infrastructure (Firecracker microVMs), so the UI should reflect that seriousness without becoming cold or corporate. The warmth comes from the typography and color palette; the precision comes from hierarchy, density, and data fidelity.
-
-Emotional goal: **in control.** Users leave a session with full confidence in what's running, what happened, and what comes next. Nothing is hidden, nothing is ambiguous.
-
-### Aesthetic Direction
-**Dark-only (permanently), industrial-warm, data-forward.**
-
-No light mode planned. All design decisions should optimize for dark. The near-black-green background palette (`#0a0c0b` through `#2a302d`) reads as "black with intention" — not pitch black (cold) and not charcoal (dated). The sage green accent (`#5e8c58`) is muted and organic, a meaningful departure from the startup-green neon that saturates the developer tool space.
-
-**Anti-references:**
- **Supabase**: avoid the friendly, approachable startup-green energy — too generic, too eager to please
- **AWS / GCP consoles**: avoid utility-first density without craft — functional but joyless, visually dated
-
-**References that capture the right spirit:**
- The precision of a well-calibrated instrument
- Editorial typography from technical publications
- The quiet confidence of tools that don't need to explain themselves
-
-### Type System
-Four fonts with strict roles — this is the design system's strongest personality trait and must be respected:
-
-| Font | CSS Class | Role | When to use |
-|------|-----------|------|-------------|
-| **Manrope** (variable, sans) | `font-sans` | UI workhorse | All body copy, nav, labels, buttons, form text |
-| **Instrument Serif** | `font-serif` | Display / editorial | Page titles (h1), dialog headings, metric values, hero moments |
-| **JetBrains Mono** (variable) | `font-mono` | Data / code | IDs, timestamps, key prefixes, file paths, terminal output, metrics |
-| **Alice** | brand wordmark only | Brand wordmark | "Wrenn" in sidebar and login only — nowhere else |
-
-Instrument Serif at scale creates the signature editorial moments. Mono provides the precision signal for technical data. Never swap these roles.
-
-**Tracking overrides (app.css):**
- `.font-serif` — `letter-spacing: 0.015em` (positive tracking; Instrument Serif reads less condensed at display sizes)
- `.font-mono` — `font-variant-numeric: tabular-nums` (numbers align in tables and metric displays)
-
-**Type scale (root: 87.5% = 14px base):**
-| Token | Value | Use |
-|---|---|---|
-| `--text-display` | 2.571rem (~36px) | Auth section headings |
-| `--text-page` | 2rem (~28px) | Page h1 titles |
-| `--text-heading` | 1.429rem (~20px) | Dialog headings, empty states |
-| `--text-body` | 1rem (~14px) | Primary body, buttons, inputs |
-| `--text-ui` | 0.929rem (~13px) | Nav labels, table cells |
-| `--text-meta` | 0.857rem (~12px) | Key prefixes, minor info |
-| `--text-label` | 0.786rem (~11px) | Uppercase section labels |
-| `--text-badge` | 0.714rem (~10px) | Live badges, tiny indicators |
-
-### Color System
-
-All values are CSS custom properties in `frontend/src/app.css`.
-
-**Backgrounds (6-step near-black-green scale):**
-| Token | Value | Use |
-|---|---|---|
-| `--color-bg-0` | `#0a0c0b` | Page base, sidebar deepest layer |
-| `--color-bg-1` | `#0f1211` | Sidebar surface |
-| `--color-bg-2` | `#141817` | Card backgrounds |
-| `--color-bg-3` | `#1a1e1c` | Table headers, elevated surfaces |
-| `--color-bg-4` | `#212624` | Hover states, inputs |
-| `--color-bg-5` | `#2a302d` | Highlighted items, selected rows |
-
-**Text (5-level hierarchy):**
-| Token | Value | Use |
-|---|---|---|
-| `--color-text-bright` | `#eae7e2` | H1s, dialog headings |
-| `--color-text-primary` | `#d0cdc6` | Body copy, primary labels |
-| `--color-text-secondary` | `#9b9790` | Secondary labels, descriptions |
-| `--color-text-tertiary` | `#6b6862` | Hints, placeholders |
-| `--color-text-muted` | `#454340` | Dividers as text, ultra-subtle |
-
-**Accent (sage green — use sparingly, must feel earned):**
-| Token | Value | Use |
-|---|---|---|
-| `--color-accent` | `#5e8c58` | Primary CTA, live indicators, focus rings, active nav |
-| `--color-accent-mid` | `#89a785` | Hover accent text |
-| `--color-accent-bright` | `#a4c89f` | Accent on dark backgrounds |
-| `--color-accent-glow` | `rgba(94,140,88,0.07)` | Subtle tinted backgrounds |
-| `--color-accent-glow-mid` | `rgba(94,140,88,0.14)` | Hover tint on accent items |
-
-**Status semantics:**
-| Token | Value | Use |
-|---|---|---|
-| `--color-amber` | `#d4a73c` | Warning, paused state |
-| `--color-red` | `#cf8172` | Error, destructive actions |
-| `--color-blue` | `#5a9fd4` | Info, neutral system states |
-
-**Borders:** `--color-border` (`#1f2321`) default; `--color-border-mid` (`#2a2f2c`) for inputs/hover.
-
-### Component Patterns
-
-**Buttons:**
- Primary: solid sage green (`--color-accent`), hover brightness boost + micro-lift (`-translate-y-px`)
- Secondary: bordered (`--color-border-mid`), text transitions to accent on hover
- Danger: red text + subtle red background on hover
- All: `transition-all duration-150`
-
-**Inputs:**
- Border `--color-border`, background `--color-bg-2`; focus transitions border and icon to accent
- Group focus pattern: `group` wrapper + `group-focus-within:text-[var(--color-accent)]` on icon
-
-**Tables / data lists:**
- Grid layout; header `bg-3` + uppercase `--text-label`; row hover `hover:bg-[var(--color-bg-3)]`
- Status stripe: left border color matches sandbox state
-
-**Status indicators:** Running = animated ping + sage green dot; Paused = amber dot; Stopped = muted gray. Color is never the sole differentiator.
-
-**Modals & dialogs:** Border + shadow only — no accent gradient bars/strips. `fadeUp` 0.35s entrance.
-
-**Empty states:** Large icon with glow, Instrument Serif heading, secondary body text, CTA below, `iconFloat` 4s animation.
-
-**Animations (always respect `prefers-reduced-motion`):** `fadeUp` (entrance), `status-ping` (live indicator), `iconFloat` (empty states), `spin-once` (refresh), staggered `animation-delay` on lists.
-
-### Design Principles
-
-1. **Precision over friendliness.** Every element earns its place. Wrenn doesn't need to tell you it's developer-friendly — that should be self-evident from the quality of the information architecture.
-
-2. **Density with breathing room.** Data-forward doesn't mean cramped. Strategic whitespace creates calm hierarchy within dense contexts. Sections breathe; rows don't waste space.
-
-3. **Industrial warmth.** The serif + mono + warm-black combination prevents sterility. This is a forge, not a gallery. The warmth is in the details, not the primary colors.
-
-4. **Legible at speed.** Users scan dashboards in seconds. Strong typographic contrast (serif h1, mono IDs, sans body), consistent patterns, and predictable placement let users orientate instantly without reading everything.
-
-5. **Craft signals trust.** For infrastructure that runs production code, the quality of the UI is a proxy for the quality of the product. Pixel-level decisions matter. Polish is not decoration — it's a trust signal.
-
-<!-- code-review-graph MCP tools -->
-## MCP Tools: code-review-graph
-
-**IMPORTANT: This project has a knowledge graph. ALWAYS use the
-code-review-graph MCP tools BEFORE using Grep/Glob/Read to explore
-the codebase.** The graph is faster, cheaper (fewer tokens), and gives
-you structural context (callers, dependents, test coverage) that file
-scanning cannot.
-
-### When to use graph tools FIRST
-
- **Exploring code**: `semantic_search_nodes` or `query_graph` instead of Grep
- **Understanding impact**: `get_impact_radius` instead of manually tracing imports
- **Code review**: `detect_changes` + `get_review_context` instead of reading entire files
- **Finding relationships**: `query_graph` with callers_of/callees_of/imports_of/tests_for
- **Architecture questions**: `get_architecture_overview` + `list_communities`
-
-Fall back to Grep/Glob/Read **only** when the graph doesn't cover what you need.
-
-### Key Tools
-
-| Tool | Use when |
-|------|----------|
-| `detect_changes` | Reviewing code changes — gives risk-scored analysis |
-| `get_review_context` | Need source snippets for review — token-efficient |
-| `get_impact_radius` | Understanding blast radius of a change |
-| `get_affected_flows` | Finding which execution paths are impacted |
-| `query_graph` | Tracing callers, callees, imports, tests, dependencies |
-| `semantic_search_nodes` | Finding functions/classes by name or keyword |
-| `get_architecture_overview` | Understanding high-level codebase structure |
-| `refactor_tool` | Planning renames, finding dead code |
-
-### Workflow
-
-1. The graph auto-updates on file changes (via hooks).
-2. Use `detect_changes` for code review.
-3. Use `get_affected_flows` to understand impact.
-4. Use `query_graph` pattern="tests_for" to check coverage.
--- a/pyproject.toml
+++ b/pyproject.toml
@ -1,6 +1,6 @@
 [project]
 name = "wrenn"
-version = "0.1.2"
+version = "0.1.1"
 description = "Python SDK for Wrenn"
 readme = "README.md"
 license = "MIT"
Author	SHA1	Message	Date
Tasnim Kabir Sadik	06b4a8cbcb	Merge issues fixed All checks were successful ci/woodpecker/pr/check Pipeline was successful Details	2026-05-02 21:46:16 +06:00
Tasnim Kabir Sadik	04e5dc652f	Fix error handling, resource leaks, and logic bugs across the SDK Bugs fixed: - files.py: use typed error checking (_raise_for_status) instead of raw raise_for_status(), ensuring WrennNotFoundError etc. are raised correctly - exceptions.py: check both "capsule_ids" and "sandbox_ids" response keys for backwards compatibility - code_interpreter: retry _ensure_kernel on 5xx errors (only fail on 4xx), remove redundant TimeoutError in bare except, clean up non-standard top-level msg_id/msg_type from Jupyter messages Resource leaks fixed: - capsule.py: close WrennClient if capsule creation or init fails - code_interpreter: add close()/__del__ for _proxy_client cleanup when not using context manager Logic fixes: - pty.py: yield exit events to callers instead of silently discarding them - capsule.py: auto-resume paused capsules in wait_ready instead of failing - capsule.py: log warnings on destroy failure in __exit__ instead of silently swallowing errors	2026-05-02 21:34:02 +06:00
Tasnim Kabir Sadik	4a7db8e204	fix: set httpx read timeout for long-running commands and handle non-JSON error responses - Set per-request httpx timeout (command timeout + 10s buffer) in Commands.run() and AsyncCommands.run() for foreground exec calls, preventing HTTP read timeouts on long-running commands - Raise WrennInternalError instead of raw httpx.HTTPStatusError when handle_response() encounters a non-JSON error body (e.g. 502 from a reverse proxy)	2026-05-02 19:02:39 +06:00
pptx704	aa9477ffe8	Added doc generator for SDK All checks were successful ci/woodpecker/push/check Pipeline was successful Details	2026-04-24 00:01:20 +06:00
pptx704	2bb3dbd71d	Merge branch 'main' of git.omukk.dev:wrenn/python-sdk into dev	2026-04-23 23:53:15 +06:00
Rafeed M. Bhuiyan	3f26a2fbcf	Merge branch 'main' into dev Some checks failed ci/woodpecker/push/check Pipeline was canceled Details	2026-04-23 12:38:41 +00:00
pptx704	2faf0dd0ae	Updated woodpecker config All checks were successful ci/woodpecker/push/check Pipeline was successful Details	2026-04-23 18:36:35 +06:00
pptx704	68c7d0de42	ci: add test pipeline, PyPI release workflow, and lint fixes - Update Woodpecker to run unit and integration tests in parallel - Add GitHub Actions workflow for PyPI trusted publishing on main - Add license, classifiers, keywords, and URLs to pyproject.toml - Fix ruff lint errors (unused imports, duplicate class name) and formatting	2026-04-23 18:32:59 +06:00
Rafeed M. Bhuiyan	ad64c85393	Merge pull request 'Feat: Added git support' (#5 ) from feat/git-support into dev Some checks failed ci/woodpecker/push/check Pipeline failed Details Reviewed-on: #5	2026-04-22 23:45:36 +00:00
pptx704	bab53aedbe	Updated readme	2026-04-23 05:44:49 +06:00
pptx704	82e181dd7e	test: add integration tests for capsule lifecycle, commands, files, and git 43 tests across 4 classes hitting the live API. Shared capsule per class to minimize VM boot overhead. All capsules destroyed in teardown. Skips automatically when WRENN_API_KEY is not available.	2026-04-23 05:40:06 +06:00
pptx704	ee1f55635f	fix: wrap commands in /bin/sh -c for proper server-side argv expansion The server-side agent runs commands through a nice wrapper that uses "${@}" expansion. Sending the full command string as a single cmd field caused nice to treat it as one executable name. Now Commands.run sends cmd=/bin/sh args=["-c", cmd_string] so "${@}" expands into proper argv.	2026-04-23 05:16:08 +06:00
pptx704	6bdf28e2ae	Added git integration	2026-04-23 04:46:57 +06:00
pptx704	61bc040098	Minor patches Some checks failed ci/woodpecker/push/check Pipeline failed Details	2026-04-23 02:31:47 +06:00
pptx704	7b35ffb60c	docs: add Google-style docstrings to all public SDK methods Some checks failed ci/woodpecker/push/check Pipeline failed Details	2026-04-17 04:29:34 +06:00
pptx704	42bcc792d6	Updated dependency Some checks failed ci/woodpecker/push/check Pipeline failed Details	2026-04-17 03:29:45 +06:00
pptx704	3f97c73b2f	feat: redesign code interpreter with structured Execution model Some checks failed ci/woodpecker/push/check Pipeline failed Details Replace flat CodeResult with a proper model hierarchy: Execution (top-level), Result (per-output with typed MIME fields), Logs (stdout/stderr as lists), and ExecutionError (structured name/value/traceback). Handle display_data messages for rich output, add streaming callbacks (on_result, on_stdout, on_stderr, on_error), and remove the misleading stdout-to-text fallback. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 03:16:39 +06:00
Rafeed M. Bhuiyan	7e7ecbd48a	Merge pull request 'feat: implement client architecture and sandbox environment' (#3 ) from feat/client-and-sandbox-support into dev Some checks failed ci/woodpecker/push/check Pipeline failed Details Reviewed-on: #3	2026-04-15 15:35:40 +00:00
pptx704	7b9a06d1b5	chore: add python-dotenv dependency Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 21:33:53 +06:00
pptx704	3d0eda5c60	feat: rename kill to destroy, improve code interpreter, update README - Rename Capsule.kill/AsyncCapsule.kill to destroy for frontend consistency - Add Sandbox deprecation alias to wrenn.code_interpreter module - run_code text falls back to stripped stdout when no expression result - Strip quotes from string expression results (matching e2b behavior) - _ensure_kernel reuses existing Jupyter kernels before creating new ones - Rewrite README with complete examples for capsules and code interpreter - Remove stale AGENTS.md Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 18:58:59 +06:00
pptx704	eecf1dc65b	chore: update OpenAPI schema, generated models, and build config Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 15:31:07 +06:00
pptx704	3cced768a4	feat: redesign SDK with e2b-compatible interface Replace the WrennClient-centric API with a top-level Capsule class that mirrors e2b's Sandbox interface, enabling drop-in migration. Key changes: - Capsule/AsyncCapsule with direct construction (reads WRENN_API_KEY and WRENN_BASE_URL env vars), namespaced sub-objects (capsule.commands, capsule.files), dual instance/static lifecycle methods via _DualMethod descriptor (capsule.kill() and Capsule.kill(id)) - WrennClient simplified to API-key-only endpoints (capsules, snapshots); JWT-based resources (auth, hosts, teams) removed - wrenn.code_interpreter submodule with Capsule subclass defaulting to code-runner-beta template and run_code() support - Sandbox alias emits FutureWarning instead of DeprecationWarning Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 15:19:23 +06:00
Tasnim Kabir Sadik	0ac9bf79ee	feat: created README	2026-04-13 03:16:44 +06:00
Tasnim Kabir Sadik	bf5914c0a8	fix: renamed sandbox to capsule	2026-04-13 03:16:27 +06:00
Tasnim Kabir Sadik	976af9a209	ci: woodpecker doesn't support variable expansions outside of commands	2026-04-12 03:08:34 +06:00
Tasnim Kabir Sadik	f3fd6865f9	ci: bug fixes	2026-04-12 03:03:33 +06:00
Tasnim Kabir Sadik	340ed46df6	CI for linting and testing	2026-04-12 02:51:14 +06:00
Tasnim Kabir Sadik	a5bf66c199	feat: add sandbox filesystem and terminal support Add sandbox filesystem methods (list_dir, mkdir, remove, upload, download, stream_upload, stream_download) and interactive PTY sessions (PtySession, AsyncPtySession) with reconnect support per FILE_TERMINAL.md spec. Refactor error handling into exceptions.py as shared handle_response(). Replace API-key-only proxy auth with unified _proxy_headers() supporting both API key and JWT. Fix stream_upload to build multipart manually instead of relying on httpx files= with generators. Switch Makefile SPEC_URL from main to dev branch. Regenerate models from updated OpenAPI spec (adds teams, channels, metrics, PTY endpoints). Add comprehensive unit and integration tests. Trim AGENTS.md to verified facts only.	2026-04-12 02:35:20 +06:00
Tasnim Kabir Sadik	f51a962fff	feat: implement client architecture and sandbox environment Introduces the core Wrenn client and a dedicated sandbox execution environment. This includes automated model generation and a custom exception hierarchy to support robust integration. - Add `WrennClient` in `src/wrenn/client.py` for API interaction. - Implement `Sandbox` in `src/wrenn/sandbox.py` for isolated execution. - Add Pydantic/model support via `_generated.py`. - Define project-specific error types in `exceptions.py`. - Include AGENTS.md documentation for specialized logic. - Add comprehensive unit and integration tests. - Update build system (Makefile, uv.lock, pyproject.toml) and LICENSE.	2026-04-10 22:24:50 +06:00