Cloud Wars
  • Home
  • Top 10
  • CW Minute
  • CW Podcast
  • Categories
    • AI and Copilots
    • Innovation & Leadership
    • Cybersecurity
    • Data
  • Member Resources
    • Cloud Wars AI Agent
    • Digital Summits
    • Guidebooks
    • Reports
  • About Us
    • Our Story
    • Tech Analysts
    • Marketing Services
  • Summit NA
  • Dynamics Communities
  • Ask Copilot
Twitter Instagram
  • Summit NA
  • Dynamics Communities
  • AI Copilot Summit NA
  • Ask Cloud Wars
Twitter LinkedIn
Cloud Wars
  • Home
  • Top 10
  • CW Minute
  • CW Podcast
  • Categories
    • AI and CopilotsWelcome to the Acceleration Economy AI Index, a weekly segment where we cover the most important recent news in AI innovation, funding, and solutions in under 10 minutes. Our goal is to get you up to speed – the same speed AI innovation is taking place nowadays – and prepare you for that upcoming customer call, board meeting, or conversation with your colleague.
    • Innovation & Leadership
    • CybersecurityThe practice of defending computers, servers, mobile devices, electronic systems, networks, and data from malicious attacks.
    • Data
  • Member Resources
    • Cloud Wars AI Agent
    • Digital Summits
    • Guidebooks
    • Reports
  • About Us
    • Our Story
    • Tech Analysts
    • Marketing Services
    • Login / Register
Cloud Wars
    • Login / Register
Home » Pinecone Tackles Threat Detection, ‘Extreme Classification’ with New Vector Database
Data

Pinecone Tackles Threat Detection, ‘Extreme Classification’ with New Vector Database

John FoleyBy John FoleyJune 17, 2021Updated:December 13, 20215 Mins Read
Facebook Twitter LinkedIn Email
Share
Facebook Twitter LinkedIn Email

Pinecone Systems is demonstrating how vector databases in the cloud offer a fast and scalable way to develop critical business capabilities, powered by machine learning, such as IT threat detection and the complex classifications and recommendations of big data.

Pinecone has released templates to help developers and technical teams build and deploy applications that address four common scenarios with the scale and performance advantages of a vector database. And Pinecone is providing a benchmark guide that can be used to assess the vector database’s performance using an organization’s own data sets.

The use cases are an important step forward because vector databases are relatively new in the world of cloud databases. A vector database stores, searches, and retrieves vectors, which are long strings of numbers representing documents, images, and other data types used in machine learning applications. They can be used for recommendations, personalization, image search, and more.

Pinecone’s vector database, which is in beta availability, provides similarity search as a service on AWS and Google Cloud. Pinecone has published templates to help users get started with four scenarios:

  • IT threat detection. Pinecone shows how to build a network intrusion detector using deep learning and similarity search. By checking the similarity of incoming threats with known attacks, the database is able to detect “rare” events that may represent a potential threat.
  • Semantic text search. Pinecone outlines how to create a semantic text search capability for online news articles using short, simple queries. To do it, vector representations of the articles are stored in the database index.
  • Extreme classification. The idea is to label new items automatically when the number of possible labels is “enormous” or extreme, such as matching web content to relevant advertisements. In the example provided, 250,000 labels are converted into vector embeddings.
  • Video recommendations. The challenge here is to provide movie recommendations based on similar user ratings (on a scale of 1 to 5), but it is complicated by the fact that the ratings are sparse relative to all movies and biased because the user ratings are distributed differently. The solution involves a dataset of movie recommendations, deep learning models for both movies and users, and a deep ranking model to score user/movie pairings for improved relevance of recommendations.
Billions of vectors

In addition to those use cases, Pinecone has introduced a benchmarking guide for testing performance and accuracy against its similarity search using an organization’s own data. The tutorial addresses how to measure indexing runtime, query runtime, and other metrics, for both exact and approximated searches.

And finally, Pinecone is providing early access to an upcoming capability called Managed FAISS. An acronym for Facebook AI Similarity Search, FAISS is a library that developers use to search for embeddings of multimedia documents that are similar. With FAISS as a managed cloud service, Pinecone aims to scale to billions of vectors without the operational complexity of a self-hosted approach.

Bigger, better, faster

The application templates and other new developments are signs that Pinecone’s vector database is maturing, and they are prerequisites for Pinecone’s general availability as a cloud service.

When I talked to Pinecone founder and CEO Edo Liberty recently, he said the company is focused on the “production readiness” of its platform. By the end of this year, he said, the Pinecone vector database will be “much more capable, bigger, better, faster, and easier to use.”

Listen to the podcast: Pinecone Systems CEO Edo Liberty: The Cloud Database Report Podcast

Pinecone, a startup, exited stealth mode in January with $10 million in seed funding from Wing Venture Capital, an early-stage investor in Snowflake. That has prompted comparisons to Snowflake’s cloud database platform model.

Liberty says businesspeople are interested in understanding how machine learning can be applied to meet their own business objectives. “They go to their chief scientist or CTO and say, ‘Why don’t we do that?’”

The new use cases and real-world benchmarks should help elevate those conversations from the arcane technical details of vector databases to business solutions and opportunities.

 
RECOMMENDED READING

Cloud Database Top 20

Data Clouds: Tech’s Next Big Innovation or Just Another Buzzword?

Snowflake Hits Another Milestone: 1 Billion Queries in 24 Hours

Cloud Vendors Confront ‘Highest Risk’ Projects: Database Migration

Snowflake: 4 Big Steps on Journey to $1 Billion in Data Cloud Revenue

Surging Cloud Databases Will Blow Past Legacy Databases in Mega Platform Shift

The Cloud Database Market Is Booming: 10 Key Developments

 

This news analysis is provided by the Cloud Database Report. As the pace of change in data management accelerates, business and IT decision makers are keenly aware that “big data” represents tremendous value if they are able to capitalize on it. Cloud databases are increasingly perceived as a faster, better, cheaper way to gain insights and drive innovation.

Subscribe to our free newsletter for regular updates.

AWS Cloud Database data Data Revolution Google Cloud
Share. Facebook Twitter LinkedIn Email
John Foley
  • LinkedIn

John is founder of the Cloud Database Report and host/Sr. Analyst for the Data Revolution channel on Acceleration Economy. For more than 20 years, John has covered database management systems and data warehouses, and the ongoing challenges businesses face with data quality, policy, performance, and scale. He also writes and podcasts regularly about the latest trends and innovations in cloud database platforms, including data integration, analytics, machine learning, data transformation, autonomous management, and hybrid clouds. John digs into real-world use cases and best practices that lead to data-driven insights and actions. Recently he helped drive strategy as a communications leader at Oracle, IBM, and MongoDB. That first-hand industry experience informs his perspective and analysis.

Related Posts

AI Agents, Data Quality, and the Next Era of Software | Tinder on Customers

July 3, 2025

Ajay Patel Talks AI Strategy and Enterprise Adoption Trends | Cloud Wars Live

July 2, 2025

Google Cloud Still World’s Hottest Cloud and AI Vendor; Oracle #2, SAP #3

July 1, 2025

Hottest Cloud Vendors: Google Cloud Still #1, But Oracle, SAP Closing In

July 1, 2025
Add A Comment

Leave A Reply Cancel Reply

You must be logged in to post a comment.

Recent Posts
  • AI Agents, Data Quality, and the Next Era of Software | Tinder on Customers
  • AI Agent & Copilot Podcast: AIS’ Brent Wodicka on Operationalizing AI, the Metrics That Matter
  • Ajay Patel Talks AI Strategy and Enterprise Adoption Trends | Cloud Wars Live
  • Slack API Terms Update Restricts Data Exports and LLM Usage
  • Google Cloud Still World’s Hottest Cloud and AI Vendor; Oracle #2, SAP #3

  • Ask Cloud Wars AI Agent
  • Tech Guidebooks
  • Industry Reports
  • Newsletters

Join Today

Most Popular Guidebooks

Accelerating GenAI Impact: From POC to Production Success

November 1, 2024

ExFlow from SignUp Software: Streamlining Dynamics 365 Finance & Operations and Business Central with AP Automation

September 10, 2024

Delivering on the Promise of Multicloud | How to Realize Multicloud’s Full Potential While Addressing Challenges

July 19, 2024

Zero Trust Network Access | A CISO Guidebook

February 1, 2024

Advertisement
Cloud Wars
Twitter LinkedIn
  • Home
  • About Us
  • Privacy Policy
  • Get In Touch
  • Marketing Services
  • Do not sell my information
© 2025 Cloud Wars.

Type above and press Enter to search. Press Esc to cancel.

  • Login
Forgot Password?
Lost your password? Please enter your username or email address. You will receive a link to create a new password via email.