1 of 9

Automated Fact Checking Based on Czech

Wikipedia

Tomáš Mlynář

Bachelor’s thesis presentation

Supervisor: Ing. Herbert Ullrich

2 of 9

Assignment

Combine previous results from AIC to build a showcase application.

2/9

Dataset

3/9

Filtering

4/9

Document Retrieval

5/9

Natural Language Inference

6/9

Prototype Showcase Application

7/9

Conclusion

Thesis Accomplishments:

Explored state-of-the-art methods for NLI and document retrieval.
Preprocessed Wikipedia dump from Hugging Face.
Localized FEVER claims using implemented translator and translation model.
Created new Czech dataset available on Hugging Face.
Applied noise filtering on three optimized thresholds (with limited success).
Finetuned and evaluated NLI models on the new datasets (available on Hugging Face).
Implemented and evaluated Anserini and Hybrid document retrievers.
Integrated NLI and document retrieval models into a pipeline.
Developed a prototype showcase application.

8/9

Thank you for your attention!

9