Computational Linguistics I
Computers have made it possible, even easy, to collect vast amounts of text from a wide variety of sources. It is not always clear, however, how to use those data and how to extract useful information from data. This problem is faced in a tremendous range of scholarly, government, business, medical, and scientific applications. The purpose of this course is to teach some of the best and most general approaches to get the most out of natural language.
Finally, it is required that you have regular access to a computer and an Internet connection throughout this course. A laptop is preferable. If you have a laptop, it would be useful to bring the laptop to class, especially for the lab sessions (see below).
This is a flipped course. Lectures are delivered through the Internet, and the traditional “class time” is used for hands on projects, discussion, and working on homework. To make sure that you are watching the videos, there will be a quiz at the start of every class on the material covered in the videos.
Halfway through the course, students will complete an in-class midterm that will test high-level understanding of concepts.
In addition, students will work on a project that emphasizes the concepts covered in the class. The project will be in a group focusing on a shared problem. This project will have three stages:
2.2 Required Background
Mathematical maturity: We will work extensively with probability and mathematical functions such as logarithms and differentiation. You should be comfortable manipulating these concepts algebraically. You should also be able to argue why mathematical statements are true (even if it’s not a 100% formal proof).
We will make extensive use of the Python programming language. It is assumed that you know or will quickly learn how the program in Python. Apart from a quick introduction, there will be no introduction to this skill-set.
The computer-based aspects of this course will be oriented toward Unix-like operating systems (Linux, OS X). It may be possible to complete the course using other operating systems, but you will be responsible for troubleshooting any issues you encounter.
Components of the final grade are as follows:
It is possible to earn extra credit by going above and beyond the expectations of the assignment or by attending relevant extra-curricular events.
Letter grades will be assigned as follows:
I reserve the right to curve up the threshold (i.e. a lower point value may result in a higher grade), but I will not curve down (i.e., a higher point value will not result in a lower grade). The thresholds will be placed uniformly for the entire class.
Please note that if the final grade tabulation comes out to be 79.98, then that corresponds to a C+; I have been exact in the above specifications deliberately. I’m sorry, but if I negotiate on any of these cutoffs, I then need to negotiate on the next one (e.g. if I rounded 79.95 up, then I would get harrassed about 79.94). Especially for large classes, this results in chaos. I need to draw the line somewhere, and I’m sorry if fall on the wrong side of the line (I’ve had that happen to me before too).
There will be a number homework assignments (with different numbers of points). Together, they are worth 30% of your final grade. Assignments are designed to help you learn the material, so please use them for that! You are allowed to collaborate with others (as many people as you'd like), but you must turn in your own assignment. For example, you could work together in a group, but each person must write up their solutions individually. Everything you submit must be your own. Directly from your fingers. Copying and pasting from another student will be considered plagiarism and will be handled according to the university's academic integrity policies.
Assignments are due at 11:59 Eastern on the Friday indicated on the syllabus. Late policy: each person has seven free late days to be used, no questions asked, during the course. Late assignments beyond the provided late grades will not be graded. The grade of the lowest homework assignment will be thrown out.
There will be an in-class midterm and final. The midterm will cover material in the previous lectures and you will be allowed to use one page of handwritten notes and the course text. Likewise, the final will be cumulative but biased toward end of course.
More information will be posted on a separate page for the final project.
Each class is critical to your learning experience, and I expect you to come to class prepared (having read all assigned readings, ready to engage). I also expect active participation, not passive reception of the material. Your energy in contributing to class discussions and hands-on exercises will make this class an enjoyable experience for all of us. Class participation contributes to your total grade (see above).
We will also be using the online learning platform Piazza. You can get credit for participation by answering and asking useful questions on that platform. Ideally you should be participating both online and in class, however.
The University of Maryland, College Park has a nationally recognized Code of Academic Integrity, administered by the Student Honor Council. This Code sets standards for academic integrity at Maryland for all undergraduate and graduate students. As a student you are responsible for upholding these standards for this course. It is very important for you to be aware of the consequences of cheating, fabrication, facilitation, and plagiarism. For more information on the Code of Academic Integrity or the Student Honor Council, please visit http://www.shc.umd.edu.
5. Course Policies
The University has a legal obligation to provide appropriate accommodations for students with disabilities. Please inform the professor of any accommodations needed relative to disabilities at the start of the semester.
Also, University of Maryland policy states that students should not be penalized due to observances of their religious beliefs. Please inform the professor of such instances at the start of the semester so that appropriate steps can be taken.