SYS-CON MEDIA Authors: Liz McMillan, AppDynamics Blog, David Sprott, tru welu, Blue Box Blog
News Feed Item
Catalyst Files Patent for Next-Generation Technology Assisted Review Based on 'Reinforcement Learning'
New Research Validates That Continuous Learning Methods Improve Savings and Results in Technology Assisted Review
|By Marketwired .
|June 11, 2014 02:19 PM EDT
DENVER CO -- (Marketwired) -- 06/11/14 -- Catalyst Repository Systems -- a pioneer in developing secure, cloud-based software to help corporations and their law firms take control of e-discovery, compliance and regulatory matters -- today announced it has applied for a patent on the type of continuous learning capability it invented for its next-generation technology assisted review (TAR 2.0) platform, Insight Predict.
Described in the patent application as "reinforcement learning based document coding," Catalyst's TAR technology is able to continuously learn from actions taken by the review team throughout the review process. With reinforcement learning, certain actions -- such as coding a document as responsive or not or adding additional documents -- enable the system to continue to grow "smarter" in its ability to select relevant documents.
What is Reinforcement Learning?
Reinforcement learning differs from older TAR 1.0 systems which require training by a high-level attorney. This expensive and time-consuming approach requires the senior attorney to first review and code an initial training set of randomly selected documents. With Catalyst's reinforcement learning technology, the full review team can begin right away. As reviewers' judgments are fed back into the system and new documents added, the system's selection and ranking of relevant documents continuously improves.
A new, peer-reviewed study by two leading experts in e-discovery validates the effectiveness of continuous learning technologies in e-discovery. In a paper they will present at the Association of Computing Machinery Special Interest Group on Information Retrieval (SIGIR) international conference in July 2014, "Evaluation of Machine-Learning Protocols for Technology-Assisted Review in Electronic Discovery," Gordon V. Cormack and Maura R. Grossman conclude that non-random training methods using continuous active learning "require substantially and significantly less human review effort" and yield "generally superior results."
Why is Catalyst's Approach Unique?
Even among continuous learning systems, Catalyst's method is unique for its use of reinforcement learning rather than active learning. Active learning systems are geared towards optimizing the quality of the classifier, the algorithm that labels documents as relevant or not. By contrast, reinforcement learning is designed to optimize for the goal the user seeks to achieve, which is generally to find as many relevant documents as possible. In this way, reinforcement learning helps users reach that goal more quickly.
"In contrast to the 'one bite of the apple' approach of earlier TAR engines, Insight Predict is able to use judgmental seeds and relevance feedback to continuously learn and rank throughout the review process, while avoiding the problems of bias and incomplete coverage through its use of contextual diversity," said John Tredennick, Catalyst's founder and CEO. "This is a major benefit to our clients because it eliminates the need for subject-matter experts for training, allows the review to get started sooner, accommodates rolling uploads, and ultimately delivers savings in time and costs."
Catalyst's unique reinforcement learning system was developed by Dr. Jeremy Pickens, Catalyst's senior research scientist, and Bruce Kiefer, Catalyst's vice president, platform. Pickens, one of the world's leading search scientists and a pioneer in the field of collaborative exploratory search, has a number of patents and patents pending in the field of information retrieval.
Overcoming the Five Myths of TAR
Catalyst's technology upends a number of common misconceptions about TAR -- that training is finite based on an initial seed set, that documents for training must be selected at random, that subject matter experts are required to train the system, that training cannot start until all documents on hand, and that it does not work for non-English documents.
To read more about the myths surrounding TAR and how advanced systems disprove them, see John Tredennick's Law Technology News article, Five Myths About Technology Assisted Review.
About Catalyst Repository Systems
A pioneer in cloud-based litigation technology, Catalyst provides global corporations and their counsel with secure, hosted document repositories to manage discovery, regulatory inquiries and other complex legal matters. Clients use Insight, Catalyst's "Big Discovery" platform, and Insight Predict, our advanced technology assisted review engine, to reduce discovery costs and associated risks. Corporations gain greater control and predictability over the discovery process and greater visibility across all their legal matters.
For more information, visit Catalyst at www.catalystsecure.com or follow us on Twitter at: http://twitter.com/catalystsecure.
The most often asked question post-DevOps introduction is: “How do I get started?” There’s plenty of information on why DevOps is valid and important, but many managers still struggle with simple basics for how to initiate a DevOps program in their business. They struggle with issues related to current organizational inertia, the lack of experience on Continuous Integration/Delivery, understanding where DevOps will affect revenue and budget, etc.
In their session at DevOps Summit, JP Morgentha...
May. 30, 2015 02:30 PM EDT Reads: 719
In a recent research, analyst firm IDC found that the average cost of a critical application failure is $500,000 to $1 million per hour and the average total cost of unplanned application downtime is $1.25 billion to $2.5 billion per year for Fortune 1000 companies. In addition to the findings on the cost of the downtime, the research also highlighted best practices for development, testing, application support, infrastructure, and operations teams.
May. 30, 2015 02:15 PM EDT Reads: 1,316
Software is eating the world. Companies that were not previously in the technology space now find themselves competing with Google and Amazon on speed of innovation. As the innovation cycle accelerates, companies must embrace rapid and constant change to both applications and their infrastructure, and find a way to deliver speed and agility of development without sacrificing reliability or efficiency of operations.
In her Day 2 Keynote DevOps Summit, Victoria Livschitz, CEO of Qubell, discussed...
May. 30, 2015 02:00 PM EDT Reads: 5,570
The speed of product development has increased massively in the past 10 years. At the same time our formal secure development and SDL methodologies have fallen behind. This forces product developers to choose between rapid release times and security.
In his session at DevOps Summit, Michael Murray, Director of Cyber Security Consulting and Assessment at GE Healthcare, examined the problems and presented some solutions for moving security into the DevOps lifecycle to ensure that we get fast AND ...
May. 30, 2015 02:00 PM EDT Reads: 4,932
SYS-CON Events announced today that MetraTech, now part of Ericsson, has been named “Silver Sponsor” of SYS-CON's 16th International Cloud Expo®, which will take place on June 9–11, 2015, at the Javits Center in New York, NY.
Ericsson is the driving force behind the Networked Society- a world leader in communications infrastructure, software and services. Some 40% of the world’s mobile traffic runs through networks Ericsson has supplied, serving more than 2.5 billion subscribers.
May. 30, 2015 02:00 PM EDT Reads: 1,527
The OpenStack cloud operating system includes Trove, a database abstraction layer. Rather than applications connecting directly to a specific type of database, they connect to Trove, which in turn connects to one or more specific databases. One target database is Postgres Plus Cloud Database, which includes its own RESTful API. Trove was originally developed around MySQL, whose interfaces are significantly less complicated than those of the Postgres cloud database.
In his session at 16th Cloud...
May. 30, 2015 02:00 PM EDT Reads: 1,269
How does one bridge the gap between traditional enterprise storage infrastructures and the private, hybrid, and public cloud?
In his session at 15th Cloud Expo, Dan Pollack, Chief Architect of Storage Operations at AOL Inc., examed the workload differences and required changes to reuse existing knowledge and components when building and using a cloud infrastructure. He also looked into the operational considerations, tool requirements, and behavioral changes required for private cloud storage s...
May. 30, 2015 02:00 PM EDT Reads: 2,793
Working with Big Data is challenging, especially when decision makers depend on market insights and intelligence from your data but don't have quick access to it or find it unusable.
In their session at 6th Big Data Expo, Ian Khan, Global Strategic Positioning & Brand Manager at Solgenia; Zel Bianco, President, CEO and Co-Founder of Interactive Edge of Solgenia; and Ermanno Bonifazi, CEO & Founder at Solgenia, discussed how a revolutionary cloud-based BI along with mobile analytics is already c...
May. 30, 2015 01:00 PM EDT Reads: 5,391
In their general session at 16th Cloud Expo, Michael Piccininni, Global Account Manager – Cloud SP at EMC Corporation, and Mike Dietze, Regional Director at Windstream Hosted Solutions, will review next generation cloud services, including the Windstream-EMC Tier Storage solutions, and discuss how to increase efficiencies, improve service delivery and enhance corporate cloud solution development.
Michael Piccininni is Global Account Manager – Cloud SP at EMC Corporation. He has b...
May. 30, 2015 01:00 PM EDT Reads: 1,284
Explosive growth in connected devices. Enormous amounts of data for collection and analysis. Critical use of data for split-second decision making and actionable information. All three are factors in making the Internet of Things a reality. Yet, any one factor would have an IT organization pondering its infrastructure strategy.
How should your organization enhance its IT framework to enable an Internet of Things implementation? In this session, James Kirkland, Red Hat's Chief Architect for the ...
May. 30, 2015 01:00 PM EDT Reads: 688
While there are hundreds of public and private cloud hosting providers to choose from, not all clouds are created equal. If you’re seeking to host enterprise-level mission-critical applications, where Cloud Security is a primary concern, WHOA.com is setting new standards for cloud hosting, and has established itself as a major contender in the marketplace. We are constantly seeking ways to innovate and leverage state-of-the-art technologies.
In his session at 16th Cloud Expo, Mike Rivera, Seni...
May. 30, 2015 12:45 PM EDT Reads: 1,000
Hardware will never be more valuable than on the day it hits your loading dock. Each day new servers are not deployed to production the business is losing money. While Moore's Law is typically cited to explain the exponential density growth of chips, a critical consequence of this is rapid depreciation of servers. The hardware for clustered systems (e.g., Hadoop, OpenStack) tends to be significant capital expenses.
In his session at Big Data Expo, Mason Katz, CTO and co-founder of StackIQ, disc...
May. 30, 2015 12:30 PM EDT Reads: 5,298
There is no question that the cloud is where businesses want to host data. Until recently hypervisor virtualization was the most widely used method in cloud computing. Recently virtual containers have been gaining in popularity, and for good reason. In the debate between virtual machines and containers, the latter have been seen as the new kid on the block – and like other emerging technology have had some initial shortcomings. However, the container space has evolved drastically since coming on...
May. 30, 2015 12:15 PM EDT Reads: 1,451
The 4th International Internet of @ThingsExpo, co-located with the 17th International Cloud Expo - to be held November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA - announces that its Call for Papers is open.
The Internet of Things (IoT) is the biggest idea since the creation of the Worldwide Web more than 20 years ago.
May. 30, 2015 12:00 PM EDT Reads: 1,807
T-Mobile has been transforming the wireless industry with its “Uncarrier” initiatives. Today as T-Mobile’s IT organization works to transform itself in a like manner, technical foundations built over the last couple of years are now key to their drive for more Agile delivery practices.
In his session at DevOps Summit, Martin Krienke, Sr Development Manager at T-Mobile, will discuss where they started their Continuous Delivery journey, where they are today, and where they are going in an effort ...
May. 30, 2015 12:00 PM EDT Reads: 1,507