Article on eDiscovery Technologies Pending Law Journal Publication

The Suffolk Journal of Trial and Appellate Advocacy law journal will publish Peeking Inside the Black Box: A Preliminary Survey of Technology Assisted Review (TAR) and Predictive Coding Algorithms for eDiscovery. Shannon Brown is excited about the law journal publication. He taught eDiscovery technologies at Widener School of Law in 2015. The teaching provided insights into the educational needs of the legal community related to the complex issues associated with eDiscovery technologies. Shannon Brown is also the author of open source eDiscovery software (Prolorem eDi) used for law school classes and by legal community members.

The abstract for the pending article summarizes:

This article fills a troubling gap in the legal literature related to e-Discovery software systems. Lawyers, law students, and law school professors have no concise resource for learning about or teaching about e-Discovery technologies such as technology assisted review (TAR), “predictive coding,” and older keyword search systems.

Peeking Inside the Black Box provides the legal community with a preliminary overview of some of the algorithms and methods used in keyword search, TAR, and “predictive coding” software. The article first illustrates the ethical duties and strategic or practical reasons for knowing how these technologies work. The objective is to reduce reliance on non-lawyer experts—who may misunderstand the legal implications of applying technical systems.

Before delving into the algorithms, the article then addresses how these computer algorithms translate human-readable documents into computer-understandable “language”—called preprocessing. Surprisingly, preprocessing has not been addressed in legal literature even though this step defines what the algorithms “see” and thus the potential effectiveness of the algorithm output.

The article then explains the critical distinction between keyword search systems and TAR or predictive coding systems. This distinction, hinted at in case law and articles, finally reveals the source of the Go Fish Problem—where lawyers blindly select keywords in hope of identifying relevant materials. However, the explanation requires a basic technical understanding of how keyword search algorithms fundamentally differ from TAR or predictive coding algorithms. Once understood, lawyers gain additional insights into when and how to deploy these tools in litigation.