• Will Chen's avatar
    Replace thinking with native Gemini thinking summaries (#400) · 30b5c0d0
    Will Chen 提交于
    This uses Gemini's native [thinking
    summaries](https://cloud.google.com/vertex-ai/generative-ai/docs/thinking#thought-summaries)
    which were recently added to the API.
    
    Why? The grafted thinking would sometimes cause weird issues where the
    model, especially Gemini 2.5 Flash, got confused and put dyad tags like
    `<dyad-write>` inside the `<think>` tags.
    
    This also improves the UX because you can see the native thoughts rather
    than having the Gemini response load for a while without any feedback.
    
    I tried adding Anthropic extended thinking, however it requires temp to
    be set at 1, which isn't ideal for Dyad's use case where we need precise
    syntax following.
    30b5c0d0
context_manage.spec.ts_manage-context---smart-context-2.txt 13.7 KB