Jina AI
Jina AI
Tool
deduplicate_strings
Get top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Use this when you have many similar strings and want to select the most diverse subset that covers the semantic space. Perfect for removing duplicates, selecting representative samples, or finding diverse content.
Pricing
Per call
$0.01
Model
flat
Pay only for what you use. No subscriptions.
Inputs
strings *
arrayk
numberTry It
API
MCP Config
Input Parameters
Array of strings to deduplicate
Number of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return

