1 of 35

Systematic Content Analysis of Litigation EventS

S

C

A

L

BLOCKCHAIN

2 of 35

🤷‍♀️ �Insight from millions of court records is functionally inaccessible.

The problem:

$.10/page + nonuniform local practices + no meaningful bulk access

free, but limited in scope or coverage

costly + coverage often unclear + no bulk access

x hundreds of state and local courts

BLOCKCHAIN

3 of 35

SCALES aims to bring transparency

to the systems and processes

of the U.S. courts,

ensuring that they are

fair, efficient, and accurate

BLOCKCHAIN

4 of 35

SCALES team

SOCIAL

SCIENCE

ENGINEERING

JOURNALISM

LAW

BLOCKCHAIN

5 of 35

Interdisciplinary Fruits

© SCALES OKN

BLOCKCHAIN

6 of 35

Mission

  • Focus on what impacts new research initiatives could have in the justice system
  • Provide a space for interdisciplinary collaborations to form
  • Grow data-driven practices of assessing the justice system

© SCALES OKN

BLOCKCHAIN

7 of 35

What SCALES is

�An “open” knowledge network (OKN) would be available to all stakeholders, including the researchers who will help push this technology further. An OKN requires a nonproprietary, public–private development effort that spans the entire data science community and results in an open, shared infrastructure.

© SCALES OKN

BLOCKCHAIN

8 of 35

© SCALES OKN

BLOCKCHAIN

9 of 35

What SCALES does

Platform enables systematic analysis of court records

© SCALES OKN

BLOCKCHAIN

10 of 35

What SCALES has

  • Docket sheets for all criminal and civil cases filed in 2016 and 2017 in all 94 U.S. district courts
  • Docket sheets for all criminal and civil cases filed in 2002 – March 2021 in the Northern District of Illinois
    • Downloaded as of Fall 2020

  • All documents filed in N.D. Ill. in 2016

~750,000+ civil and criminal docket sheets

Court docs from ~12,800+ cases

© SCALES OKN

BLOCKCHAIN

11 of 35

SCALES tools

https://github.com/scales-okn

Software automatically runs PACER in a browser to download cases

Can scrape:

  • Queries
  • Dockets
  • Case summaries
  • Documents

© SCALES OKN

BLOCKCHAIN

12 of 35

Data extraction and transformation

HTML

JSON

Extracts and keys:

  • Case header information (nature of suit, judge, filing dates)
  • Parties (including address) and charges
  • Lawyers (address, phone, pro hac status)
  • Docket entries and documents if requested

© SCALES OKN

BLOCKCHAIN

13 of 35

Understand and enrich data

© SCALES OKN

BLOCKCHAIN

14 of 35

Understand and enrich data

Entity Recognition & Disambiguation

Relation to case

Entity Type

© SCALES OKN

BLOCKCHAIN

15 of 35

Judge recognition and disambiguation

IFP decisions: case study

Other settings,

e.g. bond decisions?

© SCALES OKN

BLOCKCHAIN

16 of 35

Named Entity Recognition (NER)

Amy Joan St. Eve

Amy St. Eve

A.J. St. Eve

St. Eve

Honorable

District Judge

Judge

D.J.

  • 159,000 name variants
  • 40 honorific variants
  • 10 million+ string combinations

Rules-Based Pruning

  1. Order signed by District Judge Doe
  2. Preliminary hearing held in chambers of Doe, District Judge. All parties advised...
  3. Plaintiff Doe’s motion for extension of time.

© SCALES OKN

BLOCKCHAIN

17 of 35

Custom NER pipeline

Precision compared to spaCy’s out-of-the-box NER model for 2017 dockets

PRESIDE

98.6%

spaCy en_core_web_lg

85.9%

using spaCy’s ‘PERSON’ tag

spaCy v.3 trained from scratch

© SCALES OKN

BLOCKCHAIN

18 of 35

Lawyer and Law Firm Disambiguation

© SCALES OKN

BLOCKCHAIN

19 of 35

Corporate Party Disambiguation

© SCALES OKN

BLOCKCHAIN

20 of 35

Visualizing litigation (civil example*)

Complaint

Defendant’s MTD granted in full

Settlement

Discovery

Defendant’s MSJ

Defendant’s MSJ granted in full

Settlement: Consent Decree

Trial verdict: Plaintiff

Default judgment

Defendant’s MTD denied in full

Settlement

Defendant’s MTD granted in part

Settlement of remaining claims

*criminal in progress.

Defendant’s MTD

© SCALES OKN

BLOCKCHAIN

21 of 35

Litigation event ontology

Goal: Enable users to answer the following types of questions*:

  • What events happen in litigation?
  • How do cases and claims conclude?
  • What is the pathway by which cases travel from beginning to end?

Plus layer in court, judge, party, lawyer, and claim attributes.

*Do better than the Federal Judicial Center’s Integrated Database

© SCALES OKN

BLOCKCHAIN

22 of 35

MVP litigation events

Case beginning

  • Complaint
  • Notice of Removal
  • Writ of Habeas Corpus
  • Inbound transfer
  • Other

Answer

Discovery beginning

  • Scheduling conference?
  • Initial disclosures?
  • Work backward from MSJ?

Trial beginning

How to choose which events to include?

  • Balance comprehensive classification of each docket entry (“boil the ocean”) with MVP achievability.
  • Beginning 🡪 middle (few events) 🡪 end
  • Build in user search options as backstop.

© SCALES OKN

BLOCKCHAIN

23 of 35

MVP litigation events (cont.)

    • Trial
      • Verdict (jury)
      • Findings of fact and conclusions of law (bench trial)

  • Party resolution
      • Settlement
      • Rule 68
      • Voluntary dismissal
      • Party-provided judgment, e.g. consent decree

  • Granted/partially granted dispositive motions
      • Motion to dismiss
      • Motion for judgment on the pleadings
      • Motion for summary judgment
      • Motion for a judgment as a matter of law
  • Default judgment
  • Terminating sanctions and sua sponte dismissals
  • Outbound transfer or remand

Dispositive events (claim, case, or party)

Notice of Appeal

© SCALES OKN

BLOCKCHAIN

24 of 35

Mapping and modeling challenge

Default judgment

Complaint

1 to 1

x 94 U.S. district courts

Many to 1

https://huggingface.co/scales-okn

© SCALES OKN

BLOCKCHAIN

25 of 35

Satyrn

© SCALES OKN

BLOCKCHAIN

26 of 35

GUI

© SCALES OKN

BLOCKCHAIN

27 of 35

Make analysis simple

© SCALES OKN

BLOCKCHAIN

28 of 35

Available on Github

Published and available under GPL licensing

https://github.com/scales-okn

© SCALES OKN

BLOCKCHAIN

29 of 35

Available on Github

Published and available under GPL licensing

https://github.com/scales-okn

General public license giving users 4 freedoms: to run, to study, to share, and to modify the software

© SCALES OKN

BLOCKCHAIN

30 of 35

Documentation site

Available through SCALES main site: https://scales-okn.org/

© SCALES OKN

BLOCKCHAIN

31 of 35

Case Study

How often do judges grant requests for in forma pauperis?

BLOCKCHAIN

32 of 35

Case Study

BLOCKCHAIN

33 of 35

Case Study: Fee Waiver

Nearly 40 percent of judges approve fee waivers at a rate that significantly differs from their peer judges across

all 94 districts

BLOCKCHAIN

34 of 35

Next Steps

BLOCKCHAIN

35 of 35

Next Steps

SCALES

ACCESS TO JUSTICE WORKSHOP

🡨 Ryan

BLOCKCHAIN