Classifying sensitive unstructured data like source codes or employee contracts will now be possible with the help of Sentra’s classification engine, which will now utilize large language models (LLM).
Cloud data security provider, Sentra, has announced that LLMs are now built directly into its data security platform and classification engine to help enterprise customers reduce the data attack surface.
“When properly leveraged, LLM has great potential to better classify unstructured data (such as paragraphs of text, and someday even images) than traditional pattern-matching techniques,” said Ken Buckler, research analyst at Enterprise Management Associates Inc.
Sentra’s data classification engine has traditionally used regular expressions, list classifiers, and validation functions, according to Ron Reiter, co-founder, and chief technology officer of Sentra.
LLM adds context
The use of LLMs has added additional contexts to the process of classification, effecting an efficient tool for the classification of unstructured enterprise data, Sentra said.
“There are two additional contexts which the product now supports while classifying customer data — full (document level) classification and better entity recognition,” Reiter said. “Document-level classification enables Sentra to decide on the high-level type of document. For example, whether the document is a legal contract, a payslip, or a technical documentation.”