Implementing "jsonMode" for LLM calls, and from the react app iframe calls #965

jakedahn · 2025-04-01T22:22:03Z

This PR introduces jsonMode capability to the toolshed llm api endpoints.

Unfortunately, each language model provider has a different strategy for allowing the return of json data. We primarily use Claude and Groq at the moment, claude only supports prompting (and they provide some instructions in their docs for how to do this better), and in groq you must prompt but can also require that the response is valid json.

So this PR introduces some special casing prompt additions to various models when jsonMode: true is set as an api request parameter.

I also removed a handful of never-used language models to clean up some dependencies and surfaces area we need to support.

jakedahn · 2025-04-01T22:25:03Z

This should fix #965

bfollington

Not a big deal, but a couple comments.

You can merge without my re-review.

bfollington · 2025-04-02T07:27:09Z

charm/src/iframe/static.ts

consider: mode: 'json'? seems more common to me

bfollington · 2025-04-02T07:29:13Z

toolshed/routes/ai/llm/generateText.ts

Thoughts on adding tests for this? It's got a bunch of specific edge cases so tests would act as documentation of expected behavior etc.

Might be generateable?

jsantell · 2025-04-02T18:12:19Z

toolshed/routes/ai/llm/generateText.test.ts

The Deno.test/t.step interface was used in basic-flow to handle some integration/astral slowness/assertions during development, but could probably be ported back to the BDD style -- if we don't have an explicit reason to use the inner t.step interface, consider the BDD wrapper around stepped tests e.g. import { describe, it, beforeEach } from "@std/testing/bdd"

jsantell · 2025-04-02T18:17:08Z

toolshed/deno.json

👍 slimmer deps!

anotherjesse · 2025-04-03T13:38:08Z

toolshed/routes/ai/llm/llm.handlers.ts

is there a reason we want to skip caching for json requests?

anotherjesse · 2025-04-03T13:56:29Z

toolshed/routes/ai/llm/llm.handlers.ts

ahh, this should be the cacheKey earlier - don't recompute it!

calls

…uild-llm-cache.sh

* Now ensures processes get cleaned up on failure * Creates a temporary cache directory, so we can understand what the new cache files are, passes CACHE_DIR into toolshed * Copies cache files to the correct location

logic

jakedahn requested a review from bfollington April 1, 2025 22:22

jakedahn temporarily deployed to ci April 1, 2025 22:24 — with GitHub Actions Inactive

jakedahn had a problem deploying to ci April 1, 2025 22:25 — with GitHub Actions Failure

jakedahn requested a review from anotherjesse April 1, 2025 22:28

bfollington requested changes Apr 2, 2025

View reviewed changes

jakedahn temporarily deployed to ci April 2, 2025 15:10 — with GitHub Actions Inactive

jakedahn had a problem deploying to ci April 2, 2025 15:12 — with GitHub Actions Failure

jakedahn force-pushed the llm-jsonmode branch from 2464591 to 6e7076d Compare April 2, 2025 16:02

jakedahn temporarily deployed to ci April 2, 2025 16:04 — with GitHub Actions Inactive

jakedahn had a problem deploying to ci April 2, 2025 16:06 — with GitHub Actions Failure

jakedahn temporarily deployed to ci April 2, 2025 17:33 — with GitHub Actions Inactive

jakedahn had a problem deploying to ci April 2, 2025 17:35 — with GitHub Actions Failure

jakedahn force-pushed the llm-jsonmode branch from ca4e60f to b5934d6 Compare April 2, 2025 18:07

jakedahn temporarily deployed to ci April 2, 2025 18:09 — with GitHub Actions Inactive

jakedahn had a problem deploying to ci April 2, 2025 18:11 — with GitHub Actions Failure

jsantell reviewed Apr 2, 2025

View reviewed changes

jakedahn temporarily deployed to ci April 2, 2025 20:37 — with GitHub Actions Inactive

jakedahn had a problem deploying to ci April 2, 2025 20:39 — with GitHub Actions Failure

jakedahn had a problem deploying to ci April 3, 2025 13:17 — with GitHub Actions Failure

anotherjesse reviewed Apr 3, 2025

View reviewed changes

toolshed/routes/ai/llm/llm.handlers.ts Outdated

Copy link

Contributor

anotherjesse Apr 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is there a reason we want to skip caching for json requests?

anotherjesse reviewed Apr 3, 2025

View reviewed changes

toolshed/routes/ai/llm/llm.handlers.ts Outdated

Copy link

Contributor

anotherjesse Apr 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ahh, this should be the cacheKey earlier - don't recompute it!

jakedahn temporarily deployed to ci April 3, 2025 16:02 — with GitHub Actions Inactive

jakedahn had a problem deploying to ci April 3, 2025 16:04 — with GitHub Actions Failure

jakedahn temporarily deployed to ci April 3, 2025 16:16 — with GitHub Actions Inactive

jakedahn temporarily deployed to ci April 3, 2025 16:18 — with GitHub Actions Inactive

jakedahn added 4 commits April 3, 2025 12:33

Implementing "jsonMode" for LLM calls, and from the react app iframe

03750d1

calls

removing rarely used models and their dependencies

902b681

adding new cached llm responses

594b03b

Bumping llm timeout to 60s, which should only ever impact runs of reb…

374f2fb

…uild-llm-cache.sh

jakedahn added 14 commits April 3, 2025 12:33

improving rebuild-llm-cache.sh script

5ff3d13

* Now ensures processes get cleaned up on failure * Creates a temporary cache directory, so we can understand what the new cache files are, passes CACHE_DIR into toolshed * Copies cache files to the correct location

tweaking things to be mode: "json"

07cb4be

adding some tests

9c80a6e

updating cache

5e62e9e

updating cache

ce8b69a

idk

1d5d58c

attempting to fix

bff7ea8

cache redo

e571dbf

cache streaming responses

3942552

cache tweaks

9ace614

moar

b5ae245

progress?

523e0bc

Fixing duplicate message in cache artifacts, and cleaning up caching

45d2c25

logic

tweaking tests

c94dc9f

jakedahn force-pushed the llm-jsonmode branch from 49a9d27 to c94dc9f Compare April 3, 2025 18:33

jakedahn temporarily deployed to ci April 3, 2025 18:35 — with GitHub Actions Inactive

jakedahn temporarily deployed to ci April 3, 2025 18:37 — with GitHub Actions Inactive

jakedahn merged commit a7876f6 into main Apr 3, 2025
6 checks passed

jakedahn deleted the llm-jsonmode branch April 3, 2025 18:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implementing "jsonMode" for LLM calls, and from the react app iframe calls #965

Implementing "jsonMode" for LLM calls, and from the react app iframe calls #965

Uh oh!

jakedahn commented Apr 1, 2025

Uh oh!

jakedahn commented Apr 1, 2025

Uh oh!

bfollington left a comment

Uh oh!

bfollington Apr 2, 2025

Uh oh!

bfollington Apr 2, 2025

Uh oh!

jsantell Apr 2, 2025

Uh oh!

jsantell Apr 2, 2025

Uh oh!

anotherjesse Apr 3, 2025

Uh oh!

anotherjesse Apr 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Implementing "jsonMode" for LLM calls, and from the react app iframe calls #965

Implementing "jsonMode" for LLM calls, and from the react app iframe calls #965

Uh oh!

Conversation

jakedahn commented Apr 1, 2025

Uh oh!

jakedahn commented Apr 1, 2025

Uh oh!

bfollington left a comment

Choose a reason for hiding this comment

Uh oh!

bfollington Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

bfollington Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

jsantell Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

jsantell Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

anotherjesse Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

anotherjesse Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants