第 1 页，共 109 页

Earth Engine and Google Cloud Platform

Earth Engine User Summit�June 12th, 13th, & 14th, 2017

Matt Hancher�Co-Founder and Engineering Manager, Google Earth Engine

第 2 页，共 109 页

Ground Rules

A whirlwind tour with lots of material

Use these slides as a quick-reference later

Take a break with cute animals

第 3 页，共 109 页

Agenda

Introduction to Google Cloud Platform

Introduction to Google Cloud Storage

Command Line Tools: gcloud, gsutil, and earthengine

Exporting Data, Maps, and Map Tiles

Compute Engine and Container Engine

Other Cloud Platform Services

TensorFlow and Cloud ML Engine

Savannah the Fennec Fox. Image: Tom Thai

第 4 页，共 109 页

4

For the past 19 years, Google has been building out the world’s fastest, most powerful, highest quality cloud infrastructure on the planet.

4

第 5 页，共 109 页

第 6 页，共 109 页

Carbon Neutral Since 2007.

100% Renewable Energy in 2017.

Google datacenters use half the overhead energy of typical industry datacenters.

Measure Power Usage Effectiveness (PUE)

Adjust the Thermostat

Use Free Cooling

Manage Airflow

Optimize Power Distribution

Confidential & Proprietary

Google Cloud Platform

6

第 7 页，共 109 页

Building what’s next

7

第 8 页，共 109 页

15 Years of Tackling Big Data Problems

Google �Papers

2008

2002

2004

2006

2010

2012

2014

2015

GFS

Map

Reduce

Open

Source

2005

Google

Cloud

Products

BigTable

Spanner

2016

Millwheel

Tensorflow

Dataflow

Flume Java

Dremel

Building what’s next

8

第 9 页，共 109 页

15 Years of Tackling Big Data Problems

Google �Papers

2008

2002

2004

2006

2010

2012

2014

2015

GFS

Map

Reduce

Open

Source

2005

Google

Cloud

Products

BigTable

Millwheel

Tensorflow

Spanner

2016

Dataflow

Flume Java

Dremel

Building what’s next

9

Historically, the open source community would read these papers and built out technologies based on the described concepts.

So in 2002 we described the “Google File System”. . While the rest of the world was busy buying very expensive NetApp boxes, Google was building horizontally-scalable scalable performant and reduntant storage in “Google File System”.

This allowed Google to operate at scale that most could not reach, at a fraction of the cost - all on regular commodity hardware.

In 2004 Google leveraged learnings from GFS to build a distrubited data processing engine called MapReduce.

These two concepts were described in papers, and Yahoo eventually copied these concepts in what came out to be “Hadoop”.

One important thing to note here is that Google has had Hadoop in production since early 2000s. Most enterprises did not start using Hadoop at-scale until at least 2010. That’s 10 years. In 10 years the Dow Jones has lost 50% of companies from its index. What kind of impact does not having a potentially competitive technology for 10 years have on your business?

Similarly, Bigtable allowed Google to build out vast services on top of a unified platform, launching the No-SQL movement. Bigtable was reproduces as “Hbase” and “Cassandra”. And again, it took until about 2012 for enterprises to adopt these types of technologies. That’s 8 years. Another 9 companies dropped from the Dow Jones Index.

And so on and so forth with other core Google technologies, like PubSub, Flume, Spanner, Millwheel, and TensorFlow.

What you’ll notice is two things:

Some of these technologies, like PubSub and Spanner to this day have no equals in the open source community. That’s because they are global services by default, they require global DC coverage and global networking to exist.
In case of Millwheel and Tensorflow, Google actually open sourced these projects. We’re contributing to the open source community in the form of Apache Beam and TensorFlow projects.

One reason why Google was not releasing these technologies as open source to the world is that Google’s infrastructure looks very different from other places, with a proprietary networking stack and unique container-based deployment, rather than VM-based.

第 10 页，共 109 页

“Google is living a few years in the future and sending the rest of us messages”�Doug Cutting - Hadoop Co-Creator

第 11 页，共 109 页

15 Years of Tackling Big Data Problems

Google �Papers

2008

2002

2004

2006

2010

2012

2014

2015

GFS

Map

Reduce

Open

Source

2005

Google

Cloud

Products

BigTable

Millwheel

Tensorflow

Spanner

2016

Dataflow

Flume Java

Dremel

Building what’s next

11