Analytical RoadMap Cleaning
Which segments of the Nebraska mortgage market to focus on?
2
Nebraska mortgage that Fannie Mae acquired between 2000-2021
Data visualization, Classification models, identify important features related to profit and defaulting loan
Target best segment through dominating variables
Successfully expand the market of Nebraska while fitting the 2 objectives
Business Outcome
Business Action
Data
Data analytics
Data Analysis (Who is the Competitors?)
Profit/Defaulting Loans & Servicer
Finding:
Recommendation:
3
Data Analysis (What is a good consideration factor?)
Profit/Defaulting Loans & Credit Score Group
Finding:
Recommendation:
4
Data Analysis (Where is the good region?)
Profit & Region Segmentation
The region group distinguishes where the property securing the mortgage loan.
Finding:
Recommendation:
5
Nebraska
Data Analysis (When is current trend going towards?)
Profit/Defaulting Loans & Year Group
Finding:
This is the slide footer, which can be edited under Header & Footer, along with slide numbers and date
6
5/28/18
Data Analysis (Why is currently a good investment timing?)
Original Interest Rate & Year
Finding:
7
Model
Percentage of defaulting loan = (Number of Delinquent + Number of Deed-in-Lieu) / Number of total loans
Random Forest
F30_UPB : UPB at the Time of Removal (30 Days)
ORIG_VAL : The original value of the property.
LAST_RT : Current Interest Rate
CSCORE_B : Borrower Credit Score at Origination
DTI : Debt-To-Income
OCLTV : Original Combined Loan to Value Ratio
Findings
Output before covid
According to the data that is collected before the covid time, there are three segments in the market that this research will suggest the bank to focus on:
9
C-Score
Original
value of property
High risk high return
Findings
Output before covid
The second suggested segment of the market:
10
C-Score
Interest Rate
Medium risk, medium return
Findings
Output before COVID
The third suggested segment of the market:
11
Interest Rate
Original
value of property
medium risk, high return
Findings
Output during COVID
We believe that the effect of covid will last for years, which means the findings after 2020 March will have certain reference value for future predictions.
There are some differences between the period during covid and before covid, it is clear that set a lower interest rate to people with medium C score will be better, and the similarity is that original value should always be in the highest region.
12
C-Score
Interest rate
Highest profit with low risk
Summary
Key takeaways from data analytics & modelling
13
Summary
Recommendation
Future consideration
14
Thank you!
Any Questions?
Appendixata Cleaning
Content
This is the slide footer, which can be edited under Header & Footer, along with slide numbers and date
16
5/28/18
Tableau Interactive Dashboard: https://prod-ca-a.online.tableau.com/#/site/xueyijia/workbooks/5938?:origin=card_share_link
Code and Datasets: https://drive.google.com/drive/folders/11AGIiUTq9z6hMLML3TDtRP0HGbv-O3LI?usp=sharing
Appendixata Cleaning
Data Cleaning
This is the slide footer, which can be edited under Header & Footer, along with slide numbers and date
17
5/28/18
Approach:
F30_UPB = UPB at the Time of Removal (30 Days)
Profit = Original UPB * (Original Rate / 100) * (Original Loan Term / 12) - F30_UPB
Percentage of defaulting loan
During COVID
Percentage of profit
During COVID