3 of 163

Table of contents

Networking

(4) Low-layer stuff, ARP, DHCP
(20) IP, UDP, ports, TCP
(32) TLS
(46) DNS
(57) DNSSEC (it’s just a link to Dave’s great slides)
(63) DoS, Firewall, Detection

Crypto

(78) IND-CPA, block cipher designs
(88) Diffie-Hellman, El Gamal
(93) Hashing/pRNG

(98) HTML, JS, SOP
(111) Cookies
(139) SQL injection
(158) XSS, CSRF

5 of 163

Layering

The internet is like an onion: layers!
Each layer relies on services provided by next layer below… and provides services to layer above it
Similar to levels of abstraction when you program

6 of 163

Layering

Layer 1: Physical (bits on a wire, radio waves for WiFi, etc.)
Layer 2: Link -- communication between "local" hosts
Layer 3: InterLink -- communication between different "remote" hosts (IP)
Layer 4: Transport -- TCP / UDP (more details soon)
Layers 5/6: nobody cares.
Layer 7: Application: things like HTTP (websites), SSH, etc.

7 of 163

OSI Layer 1: Physical layer

Analogy:

Something that moves data.
Carrier pigeons. Pizza delivery guy.
How to attack: shoot down the pigeon.

8 of 163

OSI Layer 1: Physical layer

In reality:

Bits moving over space.
Voltage signals, electrical signals, etc.
How to attack: cut the wire.

9 of 163

OSI Layer 2: Link layer

Analogy:

Two post offices connected by a carrier pigeon/delivery guy.
Local: you can only communicate between the post offices.

10 of 163

OSI Layer 2: Link layer

In reality:

Defines communication between local hosts (think Wi-Fi network)
Support for broadcast protocols
Main protocol we care about here: ARP

11 of 163

MAC addresses

Analogy:

Like an apartment number.
Useful for identifying somewhere in the local area, but useless for identifying somewhere in the world.

12 of 163

Attacker Definitions

(for the purposes of this class)

Man-in-the-middle (aka in-path): Attacker can see and modify traffic.
On-path: Attacker can see but not modify traffic.
Off-path: Attacker can’t see anything. (Still knows public information)

Remember all attackers can send whatever packets they want!

Think about what fields an attacker would need to guess correctly!

13 of 163

ARP: Address Resolution Protocol

Problem: Translate a global (IP) address to a local (MAC) address.
Analogy:

Mysterious package arrives addressed to someone in the complex.
We are going to have trust someone if they claim it’s theirs.

14 of 163

ARP: Address Resolution Protocol

Converts IP addresses to MAC addresses.

Host A wants to send a message to Host B
Host A gets Host B’s IP address (Call it X) from DNS
Host A broadcasts “What is the MAC address of X?”
Host B sends a message only to A “My IP is X and my MAC address is _”.
The IP, MAC pair gets cached

(If the X is outside the local network, then the gateway responds with their MAC address)

15 of 163

ARP: Spoofing

The ARP request is public. What happens if

Alice sends out an ARP request for Bob’s MAC address
Mallory responds with hers instead and beats Bob’s response.

16 of 163

ARP: Spoofing

Alice now believes Mallory’s MAC address is Bob’s and will send her packets intended for Bob.

17 of 163

DHCP: Dynamic Host Configuration Protocol

Analogy:

When you first move into your new apartment, you need to learn your new address and figure out where the laundromat is.
You don’t know anyone here, so you’re going to trust any neighbor’s advice.

18 of 163

DHCP: Dynamic Host Configuration Protocol

When you first connect to a network, you need a few things:

An IP address
The DNS server’s IP address
The gateway’s (router's) IP address

19 of 163

DHCP: Handshake

Client Discover: “I need configuration information!”

Server Offer: “Here’s some possible configurations: A, B, C”

Client Request: “I want to use configuration C”

Server Acknowledge: “Okay, I have given you config C”

Client messages are broadcast across the local network

20 of 163

DHCP: Spoofing

What happens when Alice accepts Mallory’s configuration?

Mallory now controls Alice’s:

Gateway
DNS (we talk about this later)

What can she do with this?

21 of 163

OSI Layer 3: (Inter)network Layer

Analogy:

Make lots of hops from post office to post office. Now you can send messages anywhere in the world!
Apartment numbers (MAC addresses) now useless. Instead we need global addresses (IP addresses)

22 of 163

OSI Layer 3: (Inter)network Layer

Global routing
No guarantee of reliability
Packet delivery between different local networks, “best effort”

23 of 163

Headers: IP

0	Version	IHL	DSCP	ECN	Total Length
32	Identification				Flags	Fragmentation Offset
64	Time to Live		Protocol		Header Checksum
96	Source IP Address
128	Destination IP Address
160	Payload: arbitrary data

24 of 163

OSI Layer 4: Transport Layer

Analogy:

Now that we can send messages anywhere in the world, we can start conversations.
Pen pals, correspondence chess, etc.

25 of 163

OSI Layer 4: Transport Layer

Connection: one service (on a host) to another
TCP: Reliable, in-order, connection-oriented transport
UDP: Unreliable, datagram-based transport

A datagram is a single packet message

26 of 163

Headers: UDP

0	16-bit source port	16-bit destination port
32	16-bit length field	16-bit checksum
64	Payload: arbitrary data

27 of 163

Quick detour: Ports

Analogy:

Alice is pen pals with Carol. Alice’s roommate Bob is also pen pals with Carol.
Carol’s replies are addressed to the same global (IP) address. How to tell which letters are for Bob and which are for Alice?

28 of 163

Quick detour: Ports

Analogy:

Solution: Add a room number (port) inside the letter.
In private homes like Alice/Bob, the port numbers are meaningless.
In a public office (server) like Cory Hall, the port numbers are constant and known.

29 of 163

TCP: Transmission Control Protocol

Three-way handshake:

Client sends: SYN, SEQ= x (arbitrary by the client)
Server sends: SYN-ACK, SEQ= y (arbitrary by the server), ACK = x + 1
Client sends: ACK, ACK = y + 1

The SEQ and ACK values swap depending on who’s sending the message. (You ACK the SEQ you receive.)

30 of 163

Headers: TCP

0	16-bit source port			16-bit destination port
32	Sequence number
64	Acknowledgment number (if ACK set)
96	Data Offset	Reserved	Flags	Window size
128	Checksum			Urgent pointer
160	Payload: arbitrary data

32 of 163

TCP: Vulnerabilities

TCP is not inherently secure

No confidentiality or integrity of data by default

A malicious entity who knows the sequence numbers, port numbers, and IP addresses can spoof a connection

This means on-path attackers can easily inject data.
If an off-path attacker can guess this information, they can also inject data

It also does not provide availability

TCP RST injection lets an attacker terminate connections

33 of 163

Layer 6.5: Secure communication

Analogy:

Protect against correspondence chess pirates.
(Not a real layer, because OSI is ancient and predates the invention of security.)

34 of 163

Layer 6.5: Secure communication

TCP is not inherently secure

No confidentiality or integrity of data by default

A malicious entity who knows the sequence numbers, port numbers, and IP addresses can spoof a connection

This means on-path attackers can easily inject data.
If an off-path attacker can guess this information, they can also inject data

It also does not provide availability

TCP RST injection lets an attacker terminate connections

35 of 163

TLS

Goal is to create a secure channel

Confidentiality: nobody can see the data communicated
Integrity: you know the data has not been changed in-transit
Timeliness: you are guaranteed that old data is not being replayed by a MITM

DOES NOT PROVIDE:

Anonymity: easy to figure out who is talking with whom
Availability: TLS does not prevent DoS or messages being adversarially dropped
Padding^*: one can tell how much data is being communicated
Non-repudiability: not possible to prove that this was the message you received

36 of 163

HTTPS Connection (SSL / TLS)

Browser (client) connects via TCP to Amazon’s HTTPS server
Client picks 256-bit random number R_B, sends over list of crypto protocols it supports
Server picks 256-bit random number R_S, selects protocols to use for this session
Server sends over its certificate
Client now validates cert

Browser

Amazon

Server

Hello. My rnd # = R_B. I support

(TLS+RSA+AES128+SHA1) or

(SSL+RSA+3DES+MD5) or …

My rnd # = R_S. Let’s use

TLS+RSA+AES128+SHA1

Here’s my cert

~2-3 KB of data

37 of 163

PS via RSA

Client generates premaster secret (PS), encrypts it with server's public key, and sends it to server

"If you really have the RSA private key, prove it to me by decrypting this PS."

Both sides use PS (along with R_b, R_s) to derive symmetric keys.

Browser

Here’s my cert

~2-3 KB of data

{PS}_PKAmazon

Amazon

Server

Q: Forward secrecy?

A: No forward secrecy because attacker can decrypt PS and knows R_B, and R_S and computes secrets

38 of 163

PS via Diffie-Hellman

For Diffie-Hellman, server generates random a, sends public params and g^a mod p

Signed with server’s private key

Browser verifies signature using PK from certificate

(provides security against classic Diffie-Hellman MITM)

Browser generates random b, computes PS = g^ab mod p, sends g^b mod p to server
Server also computes PS = g^ab mod p
Both sides use PS to derive symmetric keys.

Browser

Here’s my cert

~2-3 KB of data

g^b mod p

{g, p, g^a mod p} _SKAmazon

Amazon

Server

Q: Forward secrecy?

A: Has forward secrecy because shared secret never sent over the network! If attacker as SK_Amazon, cannot find a.

39 of 163

Symmetric Key Generation

Both sides now have PS.
Using PS, both sides derive four keys:

C_b - encryption key for messages from browser to the server
C_s - encryption key for messages from server to the browser
I_b, I_s - integrity keys (directional as above)

Both sides now compute a MAC on the dialog (all messages from both sides).

If a MITM tampered with the earlier steps (say, R_b, R_s, cipher suite agreement, &c), then they will be detected now.

Browser

Amazon

Server

{M₁, MAC(M₁,I_B)}_CB

MAC(dialog,I_S)

MAC(dialog,I_B)

…

41 of 163

Nice property of TLS: End-to-end security

Properly implemented TLS is end-to-end secure.
Literally anybody between you and your pal can MITM attack you, and your communication would still be secure.

42 of 163

TLS Limitations/Issues

The system requires us to trust ALL Certificate Authorities
Certificate management is complicated
TLS can’t protect against logical errors on hosts.

43 of 163

OSI Layer 7: Application Layer (aka L5)

Application-to-application connection
DHCP, HTTP, DNS, SSH, TLS, etc.
Welcome to web security...

44 of 163

If you receive an HTTP Packet it might look like this

IEEE 802.2 Headers
IP Headers
TCP Headers
HTTP Headers
HTTP Body

What is the TCP payload?

What is the IP payload?

45 of 163

If you receive an HTTP Packet it might look like this

IEEE 802.2 Headers
IP Headers
TCP Headers
HTTP Headers
HTTP Body

What is the TCP payload?

What is the IP payload?

46 of 163

If you receive an HTTP Packet it might look like this

IEEE 802.2 Headers
IP Headers
TCP Headers
HTTP Headers
HTTP Body

What is the TCP payload?

What is the IP payload?

48 of 163

DNS: Domain Name System

Protocol for translating human-friendly domain names to a numerical address: www.berkeley.edu -> 52.10.99.247

Nameservers at least responsible for pointing to you in the right direction: .edu’s name server might not know how to get to www.berkeley.edu, but it knows that berkeley.edu’s name server knows and that name server’s IP address.

49 of 163

DNS: Example (borrowed from cs168)

DNS client�(me.cs.berkeley.edu)

DNS server

.org NS

wikipedia.org servers

Where is www.wikipedia.org?

Go ask .org nameserver at 1.2.3.4

Go ask wikipedia.org server at 5.6.7.8

Answer record: 9.10.11.12

ROOT

50 of 163

DNS: Resource Record (RR) Types

Each RR has a type and TTL (How long the record should be cached)

A or AAAA: Matches a name w/ an IPv4 or IPv6 address

berkeley.edu 600 A 35.163.72.93

NS: Gives you a name server responsible for a domain

edu. 172800 NS a.edu-servers.net

51 of 163

A typical DNS response: “Hey I found what you’re asking for”

;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 39459

;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 2, ADDITIONAL: 3

;; OPT PSEUDOSECTION:

; EDNS: version: 0, flags:; udp: 4096

;; QUESTION SECTION: (What domain was queried for)

;berkeley.edu. IN A

;; ANSWER SECTION: (What the address of the queried domain is)

berkeley.edu. 16 IN A 35.163.72.93

;; AUTHORITY SECTION: (What the name servers responsible for berkeley.edu are)

berkeley.edu. 43860 IN NS adns2.berkeley.edu.

berkeley.edu. 43860 IN NS adns1.berkeley.edu.

;; ADDITIONAL SECTION: (Possibly helpful additional information: Where the nameservers are)

adns1.berkeley.edu. 90569 IN AAAA 2607:f140:ffff:fffe::3

adns2.berkeley.edu. 693 IN A 128.32.136.14

adns2.berkeley.edu. 1598 IN AAAA 2607:f140:ffff:fffe::e

52 of 163

Another typical DNS response: “I don’t know, but I can tell you who does and where they are”

;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 53577

;; flags: qr rd ra; QUERY: 1, ANSWER: 0, AUTHORITY: 2, ADDITIONAL: 2

;; OPT PSEUDOSECTION:

; EDNS: version: 0, flags:; udp: 4096

;; QUESTION SECTION: (What domain was queried for)

;berkeley.edu. IN A

;; ANSWER SECTION: (What the address of the queried domain is. Note the lack of value)

;; AUTHORITY SECTION: (What the name servers responsible for .edu are)

edu. 172800 IN NS a.edu-servers.net.

edu. 172800 IN NS b.edu-servers.net.

;; ADDITIONAL SECTION: (Possibly helpful additional information: Where the nameservers are)

a.edu-servers.net. 172800 IN A 192.5.6.30

b.edu-servers.net. 172800 IN A 192.33.14.30

53 of 163

Headers: DNS

0	Identification
16	QR	Opcode	AA	TC	RD	RA	(Just zeroes)	Response Code
32	QDCOUNT: Number of resource records in question section
48	ANCOUNT: Number of resource records in answer section
64	NSCOUNT: Number of resource records in authority section
80	ARCOUNT: Number of resource records in authority section
96	Arbitrary Data

54 of 163

DNS: Vulnerabilities

Malicious DNS server

.edu name server telling us berkeley.edu’s name server is ns1.stanford.edu’s IP address

Why doesn’t bailiwick protect against this?

On-path eavesdropper

Spoofing

Off-path attacker

Blind-spoofing

What do spoofers need to guess correctly to be accepted by the victim?

55 of 163

DNS: Off-path spoofing

Eve

DNS resolver

wikipedia.org NS

Where is www.wikipedia.org?

www.wikipedia.org 1200 A 198.35.26.96

www.wikipedia.org 1200 A 6.6.6.6

Eve can’t see the source UDP port or the DNS ID field in this scenario.

56 of 163

DNS: Kaminsky Attack

Once a resolver receives a response, it will cache the results. So not only does the spoofer need to beat the real result, once it loses, it can’t try again for a bit.
Kaminsky’s attack subverts this by trying to poison the name server’s IP address in the Additional section.

57 of 163

DNS: Kaminsky attack

Eve

DNS resolver

wikipedia.org NS

Where is aaaaaaa.wikipedia.org?

aaaaaaa.wikipedia.org does not exist

wikipedia.org 1200 NS ns1.wikipedia.org

ns1.wikipedia.org A 6.6.6.6

If this attack succeeds, then any future query for any domain in wikipedia.org will be sent to Eve.

If it doesn’t, it doesn’t matter. The cache has not been altered. Just ask on another random subdomain.

59 of 163

Why DNSSEC?

DNS provides poor security, so DNSSEC was introduced as an alternative, guaranteeing these properties:

Integrity (not confidentiality) via signatures
Cache-ability of the signatures
Extends and remains backwards compatible with DNS

60 of 163

DNSSEC

I’m not making DNSSEC slides. Let’s go to https://cs161.org/assets/lectures/lec21_dnssec.pdf

Start at slide 11.

61 of 163

NSEC

Problem: how to assert a domain DOES NOT exist?

We could sign: "Domain X does not exist" using our signing key
But that would require online cryptography ⇒ added latency

NSEC provides offline authenticated denial to solve this

Can do all the crypto "in advance"

62 of 163

NSEC

berkeley.edu

a.berkeley.edu

ab.berkeley.edu

cd.berkeley.edu

Give me an NSEC for bc.berkeley.edu.

(Prove to me that no such domain bc.berkeley.edu exists.)

ab.berkeley.edu. NSEC cd.berkeley.edu.

ab.berkeley.edu. RRSIG NSEC <signature>

63 of 163

Enumerating NSEC

[Perceived] Problem with NSEC: it allows an attacker to discover all subdomains of a site

Keep making requests for invalid domain names, and collect all NSEC records
Faster than trying to guess valid subdomains and making DNS requests for them

NSEC3 tries to solve this by using hashes

Attest that there are no domains whose HASH H falls between H(N) and H(M)
Not very effective against modern GPUs which can make many hash tries per second

64 of 163

DoS, Firewall, and Detection

65 of 163

Denial of Service

Attack on availability
Prevents legitimate users from using a service
DDoS - Distributed Denial of Service is when a server is flooded with traffic from multiple different devices

66 of 163

Denial of Service Techniques

Deny service using program flaw
Deny service using resource exhaustion

67 of 163

Problem - Unequal Playing Field

DoS through network exhaustion - easy to escape detection by using IP spoofing
Amplification techniques make cost for an attacker low compared to the damage they can cause

68 of 163

Denial of Service Techniques

DNS Lookup

The response is typically much larger than the request

SYN flooding

Can spoof IP and cause server to maintain TCP state

69 of 163

Denial of Service Mitigation

Throw money/resources at it

Ex. Cloudflare

For SYN flooding specifically, we can use SYN cookies (see lecture 22)
Isolation and scheduling mechanisms

OS may do this by killing programs that are taking too much memory

70 of 163

Firewalls

Way to protect networks cheaply at scale
For this class, think of firewalls as a set of rules that the network follows (drop or accept packet)

71 of 163

Stateless vs Stateful

Stateless packet filters examine packets and deny or allow them based on the information in the packet alone
Stateful packet filters keep track of the connection context of each packet

72 of 163

Firewalls cont.

Example rule: allow tcp connection 4.5.5.4:* -> 3.1.1.2:80

Allow any packets associated with tcp connections that start from host at 4.5.5.4 with any port and are connecting to 3.1.1.2 at port 80 (HTTP)

Example rule: allow tcp connection *:*/int -> *:*/ext

Allow any packets associated with a tcp connection that originated inside the network

Example rule: allow tcp connection *:*/ext -> 1.2.2.3:80/int

Allow any packets associated with a tcp connection that are directed toward our internal HTTP server at 1.2.2.3.

73 of 163

Firewall Disadvantages

Assumes that insiders are trusted
Some applications don’t work with firewalls
May reduce network’s usefulness (a rule that blocks an intended legitimate connection)

74 of 163

Firewall Advantages

Centralized control of a potentially large network, allowing for added security for many machines
Attack surface reduction (we specify what is allowed)

75 of 163

Detection

NIDS - network intrusion detection system
HIDS - host based intrusion detection system
Logging - log all activity, (maybe) analyze offline

76 of 163

Base Rate Fallacy

Don’t take false positive and false negative rates at face value - we need more information to analyze them

Frequency of attack
Cost of false positive vs. cost of false negative

77 of 163

Detection Strategies

Signature based - memorize specific features of an attack
Behavior based - look for evidence of compromise
Anomaly based - look for unusual activity

Learn normal behavior
Once we have a model of normal behavior, look for deviations from this pattern

Specification based

Specify what is allowed
Eg. decide that all URL parameters sent to foocorp.com servers must have at most one ‘/’ in them

See lecture 25 for more details

78 of 163

End of networking.

Cryptography

79 of 163

Cryptography: Symmetric Keys

Concept

Encrypting and decrypting use the same key

Goals

Confidentiality
Indistinguishability Under Chosen-Plaintext Attack (IND-CPA)

A crypto game which has little to do with actually decrypting a message

80 of 163

Chosen Plaintext Attack (IND-CPA)

Adversary (Attacker)

Challenger (Benign)

“Hello”, “World”

E_k(“Hello”) OR E_k(“World”)

Choose a message

E_k(M)

polynomially many

Adversary wins if they guess correctly with probability > ½

You encrypted: “Hello”/“World”

81 of 163

ECB

Why is this bad?

82 of 163

ECB

If P1 = P2, C1 = C2!

Ask challenger to re-encrypt both messages and see which one matches!

83 of 163

AES

Design

Patterns

84 of 163

OTP-like

Examples: OFB, CFB, CTR

Properties:

Plaintext does not need to be padded

Use encryption block on decrypt

Decrypt looks similar to encrypt

*On non OTP-like schemes,

decrypt looks like a mirror image

of encrypt.

Identification:

Plaintext xored with AES output

85 of 163

CTR

Identification: Plaintext xored with AES output

Properties:

Plaintext does not need to be padded

Use encryption block on decrypt

Decrypt looks similar to encrypt

86 of 163

Feedback

Examples: OFB, CFB, CBC

Properties:

Plaintext feedback:

Decryption not parallelizable

Ciphertext/output feedback:

Encryption not parallelizable

Identification:

Arrow from one block to the next.

87 of 163

CBC

Identification:

Arrow from one block to the next.

Properties:

Plaintext feedback:

Decryption not parallelizable

Ciphertext/output feedback:

Encryption not parallelizable

88 of 163

IV reuse litmus test

Three source of entropy: key, IV, plaintext

Default:

OTP reuse on entire ciphertext.

Plaintext-incorporated (plaintext or ciphertext) feedback:

OTP reuse only up to but NOT including block containing first difference.

OTP-like:

+ OTP reuse on block containing first difference.

89 of 163

Asymmetric Cryptography

90 of 163

Diffie-Hellman Key Exchange

Semantically secure: Like IND-CPA except for public key crypto
g (some group generator) and p (some prime) are public

KeyGen:

Alice randomly samples a, and sends g^a mod p to Bob
Bob randomly samples b, and sends g^b mod p to Alice
Both can compute g^ab mod p, but no one else can!

How can they do this? Why can no one else?

91 of 163

Public parameters g and p

The secrets a and b

Disguise your secret with the common paint.

g^a mod p, g^b mod p

Add in the missing piece.

(g^a)^b mod p

(g^b)^a mod p

g^ab mod p

Swap disguised secrets.

92 of 163

MITM Diffie-Hellman

Diffie-Hellman does nothing to stop man-in-the-middle.

Definition: man-in-the-middle is an attacker who sees all of the communication between Alice and Bob and can change the messages between Alice and Bob.

How does Alice know she is talking to Bob? No way to tell.
How does Bob know he is talking to Alice? No way to tell.

93 of 163

ElGamal

Like Diffie-Hellman, except one side is going to sleep through it.
g and p public parameters as before.�
Bob: Generate private key b, public key g^b mod p. Goes to sleep.
Alice: Generates private key r, public key g^r mod p.
Alice: Encodes message with secret key: m x (g^b)^r mod p.
Bob: Wakes up. Decodes with m x (g^b)^r x (g^r)^(-b) mod p.

94 of 163

Hashing / PRNG

95 of 163

Properties of Hash Functions

One-Way Function

Easy to find H(x)=y given x. Hard to find x given y=H(x).

Second Pre-Image Resistant

Given x and H(x), it is hard to find x’ such that x != x’ and H(x) = H(x’)

Collision Resistant

Hard to find ANY x and x’ such that H(x) = H(x’)

96 of 163

Pseudorandom Number Generators

Deterministic way of generating bits which look random (pseudo-random) given some actually random bits

Initialize(entropy): Given some truly random bits "entropy" initializes the internal state of the RNG

Generate(n): Generate n pseudorandom bits

Reseed(entropy): Add more entropy to the PRNG

97 of 163

Properties of pRNG

Unpredictability
Rollback resistance

If attacker knows state of the pRNG (key and counter) at time T, they shouldn’t be able to find out the state at time T-1

98 of 163

End of cryptography.

Web Security

103 of 163

What can you do with Javascript?

Change HTML content, images, style of elements, hide elements, unhide elements, change cursor, read and change cookies...

Read cookie with JS:

var x = document.cookie;

Change cookie with JS:

document.cookie = "username=John Smith; expires=Thu, 18 Dec 2013 12:00:00 UTC; path=/";

104 of 163

Outer page can specify only sizing and placement of the frame in the outer page.

We can load frames from different websites!

Frame isolation: Outer page cannot change contents of inner page. Inner page cannot change contents of outer page.

108 of 163

This is an important exception to the same-origin policy!

110 of 163

Can Access?

Originating	Trying to Access
http://wikipedia.org/a/	http://wikipedia.org/b/
http://wikipedia.org/	http://www.wikipedia.org/
http://wikipedia.org/	https://wikipedia.org/
http://wikipedia.org:81/	http://wikipedia.org:82/
http://wikipedia.org:81/	http://wikipedia.org/

111 of 163

Can Access?

Originating	Trying to Access
http://wikipedia.org/a/	http://wikipedia.org/b/
http://wikipedia.org/	http://www.wikipedia.org/
http://wikipedia.org/	https://wikipedia.org/
http://wikipedia.org:81/	http://wikipedia.org:82/
http://wikipedia.org:81/	http://wikipedia.org/

113 of 163

Session management is like wristbands...

First entrance:

present ticket, ID
ticket guy hands you a wristband

Later entrances:

show your wristband to get in

First request:

send username, password
server gives you a session token

Later requests:

send the session token with your request

114 of 163

Cookies are also like wristbands...

The server sets cookies according to cookie policy

Like distributing wristbands�

The browser sends cookies according to cookie policy

Like showing a wristband at the entrance

115 of 163

URL breakdown

https://box.unicornbox.com/files/evanbot/doodle-ai.go

116 of 163

URL breakdown

https://box.unicornbox.com/files/evanbot/doodle-ai.go

When breaking down a URL, make a split at the double-slash...

then another split at the first slash after that.

117 of 163

URL breakdown

https://box.unicornbox.com/files/evanbot/doodle-ai.go

Part 1: Protocol

Look for it before the double slash
Tells us if we use HTTP or HTTPS

118 of 163

URL breakdown

https://box.unicornbox.com/files/evanbot/doodle-ai.go

Part 2: Domain

Look for it between the double-slash and the first single slash
Tells us which web server to contact (using DNS)

119 of 163

URL breakdown

https://box.unicornbox.com/files/evanbot/doodle-ai.go

Part 3: Path

Everything after the first single slash
Tells the web server what to show you

120 of 163

Cookie policy

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

Use setting rules if the server is setting the cookie (handing out a wristband).

Use sending rules if the browser is sending the cookie (showing a wristband).

Use domain rules on the domain part of the URL.

Use path rules on the path part of the URL.

121 of 163

Cookie policy rule 1

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the domain part of the URL.

X can set a cookie with domain Y if X ends with Y and Y is not a TLD.

X = box.unicornbox.com

Y = admin.box.unicornbox.com

X ends in Y? False. Cookie is not set.

122 of 163

Cookie policy rule 1

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the domain part of the URL.

X can set a cookie with domain Y if X ends with Y and Y is not a TLD.

X = box.unicornbox.com

Y = box.unicornbox.com

X ends in Y? True. Cookie is set.

123 of 163

Cookie policy rule 1

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the domain part of the URL.

X can set a cookie with domain Y if X ends with Y and Y is not a TLD.

X = box.unicornbox.com

Y = evanbot.unicornbox.com

X ends in Y? False. Cookie is not set.

124 of 163

Cookie policy rule 1

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the domain part of the URL.

X can set a cookie with domain Y if X ends with Y and Y is not a TLD.

X = box.unicornbox.com

Y = unicornbox.com

X ends in Y? True. Cookie is set.

125 of 163

Cookie policy rule 1

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the domain part of the URL.

X can set a cookie with domain Y if X ends with Y and Y is not a TLD.

X = box.unicornbox.com

Y = cornbox.com

X ends in Y? False. Cookie is not set.

Note that “ends with” is domain matching, not string matching. Everything between the dots must match.

126 of 163

Cookie policy rule 1

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the domain part of the URL.

X can set a cookie with domain Y if X ends with Y and Y is not a TLD.

X = box.unicornbox.com

Y =

X ends in Y? True. Cookie is set.

127 of 163

Cookie policy rule 1

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the domain part of the URL.

X can set a cookie with domain Y if X ends with Y and Y is not a TLD.

X = box.unicornbox.com

Y = .com

X ends in Y? True. But cookie is not set due to exception.

TLDs include: .com, .org, .edu.

128 of 163

Cookie policy rule 2

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the path part of the URL.

X can set a cookie with path Y always.

X = box.unicornbox.com

Y = /files

Always? Always. Cookie is set.

129 of 163

Cookie policy rule 2

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for servers setting cookies (handing out wristbands).

This rule applies to the path part of the URL.

X can set a cookie with path Y always.

X = box.unicornbox.com

Y = /

Always? Always. Cookie is set.

130 of 163

Cookie policy rule 3

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for browsers sending cookies (showing wristbands).

This rule applies to the domain part of the URL.

Browser sends X a cookie with domain Y if X ends with Y.

X = https://box.unicornbox.com/files/evanbot/doodle-ai.go

Y = unicornbox.com

X ends in Y? True. Cookie is sent.

131 of 163

Cookie policy rule 4

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

This rule is for browsers sending cookies (showing wristbands).

This rule applies to the path part of the URL.

Browser sends X a cookie with path Y if X begins with Y.

X = https://box.unicornbox.com/files/evanbot/doodle-ai.go

Y = /files/evanbot

X begins with Y? True. Cookie is sent.

132 of 163

Cookie policy rule 3-4 shortcut

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

To determine if a cookie is sent, I like to use this matching trick combining rules 3-4.

https://box.unicornbox.com/files/evanbot/doodle-ai.go

unicornbox.com/files/evanbot

(domain)

Concatenate the cookie domain and path. Line it up below the requested URL at the first single slash.

(path)

133 of 163

Cookie policy rule 3-4 shortcut

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

To determine if a cookie is sent, I like to use this matching trick combining rules 3-4.

https://box.unicornbox.com/files/evanbot/doodle-ai.go

unicornbox.com/files/evanbot

(domain)

Can you get everything to match? If yes, cookie is sent.

(path)

134 of 163

Cookie policy rule 3-4 shortcut

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

To determine if a cookie is sent, I like to use this matching trick combining rules 3-4.

https://box.unicornbox.com/files/evanbot/doodle-ai.go

unicornbox.com/files/admin

(domain)

Can you get everything to match? If no, cookie is not sent.

(path)

135 of 163

Cookie policy rule 3-4 shortcut

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

To determine if a cookie is sent, I like to use this matching trick combining rules 3-4.

https://box.unicornbox.com/files/evanbot/doodle-ai.go

cornbox.com/files/evanbot

(domain)

Can you get everything to match? If no, cookie is not sent.

(path)

136 of 163

Cookie policy: in summary

Rule 1: X can set a cookie with domain Y if X ends with Y and Y is not a TLD.
Rule 2: X can set a cookie with path Y always.
Rules 3-4: Write out domain+path underneath URL. If everything matches, the cookie gets sent to URL.

	Domain	Path
Setting	Rule 1	Rule 2
Sending	Rule 3	Rule 4

137 of 163

Cookies are name-value pairs.
There are cookie flags:

Domain: which domain this cookie is for (viz., www.google.com/foo/bar)
Path: which path this cookie is for (viz., www.google.com/foo/bar)
secure: indicates the browser should only send this cookie via HTTPS
HttpOnly: indicates the browser should stop accesses to this cookie through Javascript

Beware: Cookie policy =/= same-origin policy

138 of 163

Domain	Where Cookie Would Be SENT (if set)
(value omitted)	foo.example.com (exact)
bar.foo.example.com	?
foo.example.com	foo.example.com and *.foo.example.com
baz.example.com	?
example.com	?
ample.com	Cookie not set! Different domain.
.com	?

A web server on foo.example.com tries to set a cookie with the following domains. Will the cookie be set? If so, which domains will receive the cookie? (Some rows are filled in as examples.)

139 of 163

Domain	Where Cookie Would Be SENT (if set)
(value omitted)	foo.example.com (exact)
bar.foo.example.com	Cookie not set! Domain too specific
foo.example.com	foo.example.com and *.foo.example.com
baz.example.com	Cookie not set! Domain mismatch
example.com	example.com and *.example.com
ample.com	Cookie not set! Different domain.
.com	Cookie not set! Too broad.

A web server on foo.example.com tries to set a cookie with the following domains. Will the cookie be set? If so, which domains will receive the cookie? (Some rows are filled in as examples.)

140 of 163

SQL Injection

142 of 163

SQL injection step 1: what does the SQL interpreter see?

143 of 163

SQL injection step 1: what does the SQL interpreter see?

> INSERT INTO cart (session, item)� VALUES ('____________', '____________')

sessionToken item

144 of 163

> INSERT INTO cart (session, item)� VALUES ('____________', '____________')

sessionToken item

We want the SQL interpreter to see:�

> INSERT INTO cart (session, item)� VALUES ('123', 'toilet paper'), ('123', 'toilet paper'),

('123', 'toilet paper'), ... ('123', 'toilet paper')

sessionToken = 123

item = toilet paper'), ('123', 'toilet paper'),

('123', 'toilet paper'), ... ('123', 'toilet paper

150 of 163

Prepared statements: the previous attack no longer works

151 of 163

We can’t do injection in sessionToken

152 of 163

SQL injection step 1: what does the SQL interpreter see?

> INSERT INTO cart (session, item) VALUES ('____________', '____')

sessionToken item

> SELECT item FROM cart WHERE session = '____________' LIMIT 1

sessionToken

> SELECT link FROM items WHERE item = '_______'

item

The third query is the vulnerable one, but the first query is where the malicious input happens

SQL injection step 4: ??? step 5: profit

The SQL interpreter sees:

> SELECT link FROM items WHERE item = '_______'

item

We want the SQL interpreter to see:�

> SELECT link FROM items WHERE item = 'whatever';� INSERT INTO cart (session, item)� VALUES ('123', 'toilet paper'), ('123', 'toilet paper'),

('123', 'toilet paper'), ... ('123', 'toilet paper') --'

item = whatever';� INSERT INTO cart (session, item)� VALUES ('123', 'toilet paper'), ('123', 'toilet paper'),

('123', 'toilet paper'), ... ('123', 'toilet paper') --

159 of 163

Cross-Site Scripting (XSS)

Attacker inserts malicious script into another site's webpage that will be viewed by a user.

Script runs in user's browser and bypasses same-origin policy!

For example, if there was an XSS in Piazza, an attacker could make a user run Javascript as if Piazza had sent it

Since Javascript is very powerful, this can perform actions as the victim user

160 of 163

Two types of XSS

Stored XSS:

Attacker leaves Javascript lying around on a benign web service for victim to load
Think: Facebook Wall Post.

Attacker makes a post like <script>alert(1)</script>
IF the site does not escape HTML before showing it back to the user, then this could be a stored XSS.

Reflected XSS:

Attacker gets user to load a specially-crafted URL with Javascript encoded in the URL
Think: Google's search results

To search: google.com/?query=X
Outputs "No results for X."
Parts of URL appear in output
IF Google did not do the proper escaping, then this could be a reflected XSS!