tavily icon

Why Tavily's Dynamic Filtering post answers its LinkedIn ads on substance, but loses the number-led hook in the H1

We scored 4 unique copy variants from a 4-ad LinkedIn cluster pointing to tavily.com's Dynamic Filtering blog post. The ads lead with concrete numbers: 12x cheaper than Anthropic's Programmatic Tool Calling, 77.9% F1 versus 50.8%, 3.5x fewer tokens, and a portable skill that runs in Claude Code, Cursor, and Codex. The post delivers all of that evidence, including a 900-question DeepSearchQA run on Claude Sonnet 4.6, but its H1 describes the mechanism instead of the outcome a click was sold on.

by PostClickSignal Editorial·first audited 2026-05-24·5 min read
01

Primary click path

// Ad

Tavily icon

Tavily

Promoted · LinkedIn ad sample 1

Anthropic's PTC costs $4.65 per query. Our open alternative delivers better results for $0.37.

12× cheaper AI search. No API lock-in.

1397534356

image

// Landing page

Dynamic Filtering: Let the Model Program Its Own Search Filters | Tavily Blog screenshot
https://tavily.com/blog/dynamic-filtering-let-the-model-program-its-own-search-filters
02

The score.

// Overall score

8.6
/ 10
Grade · B+
Headline match
7.5
Offer continuity
9.5
Visual + tone
8.5
Scent + intent
9
03

The verdict

Tavily's LinkedIn cluster sells outcomes: 12x cheaper than Anthropic's Programmatic Tool Calling, 77.9% F1 versus 50.8% on the same model, 3.5x fewer tokens, and a single skill file that works in any coding agent. Every one of those claims is restated, sourced, and visualized inside the destination blog post.

The friction is at the top of the page. The H1 reads 'Dynamic Filtering: Let the Model Program Its Own Search Filters,' which names the mechanism instead of the result the ads were sold on. A visitor who clicked the cost-and-accuracy hook has to scroll past the abstract before the matching numbers appear. The audit lands at B+ because the substance is unusually strong but the hero copy does not echo the dominant ad promise.

04

The ads pointing here

// Ad cluster

4

LinkedIn copy variants scored.

Scored sample: 4 ads.

Learn more

// Dominant headline

12x cheaper AI search. No API lock-in.
Cheaper than Anthropic PTCOpen and portable across any coding agentHigher F1 accuracy on the same benchmarkFewer tokens, leaner context

The LinkedIn Ad Library shows 4 unique copy variants pointing to the Dynamic Filtering post, all running the same 'Learn more' CTA. They split cleanly across the four claims in the article: cost, portability, accuracy, and token efficiency.

Variant 1 leads with '12x cheaper AI search. No API lock-in.' and the body line 'Anthropic's PTC costs $4.65 per query. Our open alternative delivers better results for $0.37.' Variant 2 reframes the same offer as portability: 'One skill file. Claude Code, Cursor, Codex - any agent.' Variant 3 leads with accuracy: '77.9% vs 50.8% F1. Same model. Open source.' Variant 4 leads with token efficiency: '3.5x fewer tokens. 53% better accuracy. Here's how.'

All four variants point to the same article URL with LinkedIn campaign UTMs, which means the ads are deliberately A/B testing which outcome hook moves a technical audience to read the post.

// Ads scored

More ad variants.

Tavily icon

Tavily

Promoted · LinkedIn ad sample 2

Anthropic's dynamic filtering only works inside their API. We rebuilt it to run anywhere.

One skill file. Claude Code, Cursor, Codex — any agent.

1397436016

image
Tavily icon

Tavily

Promoted · LinkedIn ad sample 3

We ran Anthropic's own benchmark against their dynamic filtering. Then we beat it by 27 points.

77.9% vs 50.8% F1. Same model. Open source.

1397494306

image
Tavily icon

Tavily

Promoted · LinkedIn ad sample 4

Raw search results flooding your context window degrade your agent's reasoning. There's a fix.

3.5× fewer tokens. 53% better accuracy. Here's how.

1397524366

image
05

What the page promises

The post is a 4-minute engineering write-up that introduces the principle of dynamic filtering, explains how Anthropic's Programmatic Tool Calling implements it, and then shows how Tavily rebuilt the same pattern as an open skill on the Tavily CLI that works in any harness with Bash and Python.

The benchmark section delivers the numbers the ads sell. On the full 900-question DeepSearchQA set the skill approach scores 72.7% F1 against Anthropic's reported 59.4% on the same Claude Sonnet 4.6 model. On a 50-question head-to-head run the skill scores 77.9% F1 versus 50.8% for PTC, averages 413K tokens per query at $0.37 against PTC's 1.46M tokens at $4.65, and lands at roughly 3.5x fewer tokens and 12x cheaper.

The article then explains why the pattern works - variable space stays in the execution environment, only print() output crosses into context - and closes with a 'Replicating it with the Tavily CLI' section and a 'Get started' call to action, which is what the ads' Learn more CTA is funding.

06

Dimension breakdown

Headline match
7.5

The ads lead with cost and accuracy numbers, but the H1 describes a mechanism. The matching numbers are in the article, just not in the hero.

Offer continuity
9.5

The cost, accuracy, token, and portability claims from the four ads are all restated and sourced in the post, with charts and a methodology section.

Visual tone match
8.5

LinkedIn ads aimed at AI engineering buyers landing on a benchmark-heavy engineering blog. Format and audience read are aligned.

Scent intent
9

Within the first viewport the abstract names Anthropic's PTC and the headline F1 numbers, confirming the scent of every ad variant quickly.

07

Top fixes

01

Mirror the ad hooks in the H1

The ads sell outcomes; the H1 sells a mechanism. Putting the headline numbers in the hero closes the gap between the click promise and the first thing the reader sees.

Current

Dynamic Filtering: Let the Model Program Its Own Search Filters

Rewrite

Dynamic Filtering: 12x cheaper, 3.5x fewer tokens, and 27 F1 points over Anthropic's PTC.

02

Add a clearer primary CTA near the top of the post

The ad CTA is 'Learn more,' but the campaign goal is adoption of the Tavily CLI and the dynamic-filtering skill. A 'Get the skill' or 'Try the Tavily CLI' button above the methodology section converts the read into the action the spend is funding.

Current

Get started (only at the end of the article)

Rewrite

Get the dynamic-filtering skill (in the hero, repeated at the end)

03

Surface a head-to-head summary card

The sharpest hook in the cluster is the 77.9% vs 50.8% F1 line. Pulling those numbers into a summary card under the title lets readers confirm the scent before they read the F1 explainer.

Current

Numbers appear only after the methodology and benchmark sections.

Rewrite

77.9% F1 vs 50.8% F1 on DeepSearchQA (N=50), same model, open source.

08

Rewrite preview

// Suggested hero

Dynamic Filtering: 12x cheaper, 27 F1 points better than Anthropic's PTC

We rebuilt Anthropic's Programmatic Tool Calling as a portable skill on the Tavily CLI. Same DeepSearchQA benchmark, same Claude Sonnet 4.6, $0.37 per query instead of $4.65.

09

FAQ

How many ads does Tavily run to this page?

The LinkedIn Ad Library shows 4 ads pointing to the Dynamic Filtering blog post, all with the 'Learn more' CTA and the same UTM-tagged destination. The 4 ads are 4 unique copy variants, each leading with a different outcome from the article.

Do the ad claims match what the page actually shows?

Yes. The cost claim ($0.37 vs $4.65, ~12x cheaper), the accuracy claim (77.9% F1 vs 50.8% F1 on the head-to-head, 72.7% F1 vs 59.4% on the full 900-question set), the token claim (~3.5x fewer tokens), and the portability claim (skill file runs in Claude Code, Cursor, Codex, pi) are all stated and sourced in the post.

Why did the page lose points if every claim is backed up?

The H1 describes the mechanism, not the outcome. The ads lead with cost and accuracy numbers, but the hero reads 'Dynamic Filtering: Let the Model Program Its Own Search Filters,' which is the engineering name for the pattern. A reader who clicked the cost or accuracy hook has to scroll before the matching numbers appear.

What single change would move this audit closest to A?

Rewriting the H1 around the headline numbers - cost, tokens, and F1 - so the first thing a visitor reads echoes the ad they clicked. Continuity is already at 9.5, so closing the headline gap is what unlocks the higher grade.

10

Sources

  • LinkedIn Ad Library: 4 unique copy variants from 4 ads pointing to /blog/dynamic-filtering-let-the-model-program-its-own-search-filters
  • Landing page: https://tavily.com/blog/dynamic-filtering-let-the-model-program-its-own-search-filters
  • Advertiser homepage: https://tavily.com

Want to see where your paid clicks drift?

PostClickSignal grades how well your ad cluster matches the landing page it sends traffic to, then writes the fixes for you.

Audit my ads