Jesse Robbins

photo_jesse_m.jpgJesse Robbins (@jesserobbins) is CEO of Opscode and a recognized expert in Infrastructure, Web Operations, and Emergency Management.

He serves as co-chair of the Velocity Web Performance & Operations Conference and contributes to the O’Reilly Radar. Prior to co-founding Opscode, he worked at Amazon.com with a title of “Master of Disaster” where he was responsible for Website Availability for every property bearing the Amazon brand.

Robbins is a volunteer Firefighter/EMT and Emergency Manager, and led a task force deployed in Operation Hurricane Katrina. His experiences in the fire service profoundly influence his efforts in technology, and he strives to distill his knowledge from these two worlds and apply it in service of both.

Jesse Robbins (@jesserobbins),Opscode首席执行官,业界广为认可的架构专家,Web运营专家,紧急事件管理专家。

Robbins还是Velocity Performance & Operations Conference的联合主席,也参与O'Reilly Radar。参与创建Opscode之前他为Amazon.com服务,职位是“灾难专家”,负责网站可用性从而维系包括Amazon品牌在内的一切资产。

Robbins是消防员志愿者/急诊医生和紧急事件管理人员,曾经领导了一支部署在卡特里娜飓风营救行动中的特遣分队。他在消防领域的经验深深地影响他在技术方面的工作,他努力从这两方面吸取知识然后再应用到两个领域中去。

CrisisCamps and the Pattern of Disaster Technology Innovation

Jesse Robbins @jesserobbins 2010-01-23


Over the past three years I have been working to bridge gaps between the tech community & traditional emergency management organizations.  I've focused on helping technologists adapt technologies to support humanitarian missions, often in response to a disaster.  

After Hurricane Katrina, Mikel Maron and I discovered a pattern for successful innovation during and after disasters.  Understanding this pattern is crucial to "Serving Those that Serve Others".

Pattern for DisasterTech Innovation 
1. Disaster
2. Ad-Hoc Adaptation 
3. Championship 
4. Iterative Improvement

crisiscamp-logo.png
There is an unprecedented amount of interest and attention in finding ways to help in Haiti & around the world.  The CrisisCamp & CrisisCommon projects are coordinating events and helping match organizations with needs to volunteers with skills.  I encourage you to participate, and volunteer your time, knowledge, and resources.  

Serve those that serve others.  You can make a difference now.

Upcoming Crisis Camps

Velocity 2010: Fast By Default

Jesse Robbins @jesserobbins 2009-11-24

We're entering our third year of Velocity, the Web Performance & Operations Conference.. Velocity 2010 will be June 22-24, 2010 in Santa Clara, CA. It's going to be another incredible year.

Steve & I have set a new theme this year, "Fast by Default".  We want the broader Velocity community & to adopt it as a shared mission & mantra. The reason for this is simple...

Fast isn't a Feature. Fast is a Requirement.

At Velocity earlier this year Marissa Meyer explained why performance mattered so much to Google. Then Eric Schurman (Bing & Velocity Program Committee member) and Jake Brutlag (Google Search) made history with a co-presentation on just how crucial performance is to revenue .

Phil Dixon of Shopzilla explained that a 5" second performance improvement increased their revenue by 7-12 percent":http://http://velocityconference.blip.tv/file/2290648/ while reducing hardware spend by 50%!!!

Fast means Client, Server, Infrastructure, Operations, & Organizations

Getting to Fast isn't just about any one part of the system. Browser & Client performance is crucial, and requires an equally fast server & infrastructure to support it. When load increases, infrastructure must scale quickly or performance suffers. The operational tools and processes for managing software & infrastructure must support rapid changes in a dynamic environment, and be backed by an organization & culture that embraces it.

We're Looking for Speakers - Submit your Proposal by January 11

One more thing… velocity-olc.png Quite a few people have asked us to have Velocity conferences more frequently & beyond the SF Bay Area, and so we're going to try something new. On December 8 we'll be running our first ever Velocity Online Conference. Past Velocity Conference participants get a 50% discount & get a 25% discount off Velocity 2010.

See the full schedule after the jump...

More on how web performance impacts revenue...

Jesse Robbins @jesserobbins 2009-10-01

At Velocity this year Microsoft, Google and Shopzilla each presented data on how web performance directly impacts revenue.

Their data showed that slow sites get fewer search queries per user, less revenue per visitor, fewer clicks, fewer searches, and lower search engine rankings. They found that in some cases even after site performance was improved users continued to interact as if it was slow. Bad experiences have a lasting influence on customer behavior.

What about smaller websites that aren't yet at this scale?

Alistair Croll and Sean Power, the authors of the new book Complete Web Monitoring, have continued this research for sites at smaller scale.

They used a Strangeloop Networks web acceleration appliance to optimize half the sessions to a smaller production website, tagging optimized and unoptimized visitors so they could be analyzed in Google Analytics. The Strangeloop device applies many of Steve Souders' performance rules to an existing site automatically (a kind of "Steve-in-a-Box" ;-).

The results of their analysis show how significant a reduction in page latency can be. In addition to reducing bounce rates, and increasing pages per visit & time on site, they found a 16.07% increase in conversion rates and a 5.50% increase in average order value.

conversion-rate-and-order-value.png

Check out the full post on the Watching Websites blog.

John Adams on Fixing Twitter: Improving the Performance and Scalability of the World's Most Popular Micro-blogging Site

Jesse Robbins @jesserobbins 2009-08-06

Twitter is suffering outages today as they fend off a Denial of Service attack, and so I thought it would be helpful to post John Adams’ exceptional Velocity session about Operations at Twitter.

Good luck today John & team… I know it’s going to be a long day!

Update: Apparently Facebook & Livejournal have had similar attacks today. Rich Miller from Data Center Knowledge reminds us that this is just the latest in a series of major attacks.

Jonathan Heiliger on Web Performance, Operations, and Culture

Jesse Robbins @jesserobbins 2009-06-24

We were honored to have Jonathan Heiliger, Facebook’s VP of Technology Operations, as our opening keynote speaker at Velocity. Jonathan is one of the most accomplished leaders in our field, and is a master of the craft.

Here is his keynote in its entirety:

Note: Other videos from Velocity are being posted to VelocityConference.blip.tv

Ignite! comes to San Jose June 22nd - Submit your talks now!

Jesse Robbins @jesserobbins 2009-06-08

Ignite! VelocityIgnite! is coming to San Jose on Monday June 22, 2009 at 8:00 pm, attached to the Velocity Conference. Admission is free, open to all, and there will be a cash bar.

The deadline for talks is May 11th, so submit your talks now!

As with all Ignites each speaker will only get 20 slides that each auto-advance every 15 seconds for a total of five minutes. We'll be looking for fun geek topics like hacks, how-to's, and insights. (Talks don't have to be Velocity-related!) If you're not sure what an Ignite talk looks like check out the Ignite Show.

You can RSVP for the event on Upcoming or Facebook.

CrisisCamp is June 12-14th in Washington, DC

Jesse Robbins @jesserobbins 2009-06-07

crisiscamp.png CrisisCamp is an unconference to bring together domain experts, hackers, makers, developers, and first responders to improve technology and practice for humanitarian crisis management and disaster relief. This is the first event in what I hope will become a movement, and it's happening on June 12 - 14, 2009 in Washington, DC.

Across the world, everyday people can find themselves in crisis. Whether for a day, a month or a continued state of social distress, citizens across the world have common needs for communication. We want to connect with our loved ones to let them know that we are okay (or that we need help) and we crave information by which we make decisions within that time or place of crisis. We want to let people know they they are not alone, that people across the world care and seek to act act altruistically to provide resources to aid in the crisis recovery.

CrisisCamp seeks to break down the bifurcation between international and domestic humanitarian relief agencies and unite their efforts to share lessons learned, response tools, and expertise to encourage citizen engagement and use of information communications technologies (ICTs) to aid in crisis recovery, wherever people need help. We have great hopes that with a successful CrisisCamp, we can inspire a global effort to mitigate the potential impact of times or places of crisis on the human condition.

Crisis Camp seeks participation by anyone who wishes to help. We are especially interested in the use of ICTs in developing countries, especially in the areas of access, usability, and innovation. We seek to learn from academic findings on citizen participation, needs and problem solving efforts. In addition, we seek to understand global information needs through a consumer approach, because people will use what is available and familiar if a crisis event occurs. And finally, we want to know how ICTs, in all their uses, can help citizens of all abilities, recover during a time or place of crisis.

CrisisCamp Ignite! Session Kick Off

Time: Friday, June 12, 2009 from 7:30-9PM
Location: The World Bank,1818 H St., NW Washington D.C.
Participate: Sign up at CrisisCampIgnite (separate registration required for World Bank entry)

CrisisCamp - Saturday, June 13 & Sunday, June 14th

Start Time: 9:00am both days
Location: The Institute for Politics Democracy & the Internet @ George Washington University Participate: Sign up at CrisisCamp

Time Lapse of Galactic Center of Milky Way rising over Texas Star Party

Jesse Robbins @jesserobbins 2009-05-21

Galactic Center of Milky Way Rises over Texas Star Party from William Castleman.

According to William Castleman: The time-lapse sequence was taken with the simplest equipment that I brought to the star party. I put the Canon EOS-5D (AA screen modified to record hydrogen alpha at 656 nm) with an EF 15mm f/2.8 lens on a weighted tripod. Exposures were 20 seconds at f/2.8 ISO 1600 followed by 40 second interval. Exposures were controlled by an interval timer shutter release (Canon TC80N3). Power was provided by a Hutech EOS203 12v power adapter run off a 12v deep cycle battery. Large jpg files shot in custom white balance were batch processed in Photoshop (levels, curves, contrast, Noise Ninja noise reduction, resize) and assembled in Quicktime Pro. Editing/assembly was with Sony Vegas Movie Studio 9.

[via the Primary Tentacle @ Laughing Squid]

Space Shuttle Atlantis during Solar Transit

Jesse Robbins @jesserobbins 2009-05-17

STS-125 Atlantis Solar Transit (200905120002HQ)

In this tightly cropped image, the NASA space shuttle Atlantis is seen in silhouette during solar transit, Tuesday, May 12, 2009, from Florida. This image was made before Atlantis and the crew of STS-125 had grappled the Hubble Space Telescope. Photo Credit: (NASA/Thierry Legault)

Thierry made this image using a solar-filtered Takahashi 5-inch refracting telescope and a Canon 5D Mark II digital camera. Photo Credit: (NASA/Thierry Legault)

You can see more of Thierry's fine work at: www.astrophoto.fr

from nasa hq photostream [via slashdot]

Velocity 2009 - Big Ideas (early registration deadline)

Jesse Robbins @jesserobbins 2009-05-08

what-is-velocityconf.png

(tag cloud created from Velocity session & speaker information using wordle.net)

My favorite interview question to ask candidates is: "What happens when you type www.(amazon|google|yahoo).com in your browser and press return?"

While the actual process of serving and rendering a page takes seconds to complete, describing it in real detail can take an hour. A good answer spans every part of the Internet from the client browser & operating system, DNS, through the network, to load balancers, servers, services, storage, down to the operating system & hardware, and all the way back again to the browser. It requires an understanding of TCP/IP, HTTP, & SSL deep enough to describe how connections are managed, how load-balancers work, and how certificates are exchanged and validated... and that's just the first request!

Web Performance & Operations is an emerging discipline which requires incredible breadth, focusing less on specific technologies and more on how the entire system works together. While people often specialize on particular components, great engineers always think of that component in relation to the whole. The best engineers are able to fly to the 50,000 foot view and see the entire system in motion and then zoom in to microscopic levels and examine the tiny movements of an individual part.

John Allspaw recently described this interconnectedness on his blog:

With websites, the introduction of change (for example, a bad database query) can affect (in a bad way) the entire system, not just the component(s) that saw the change. Adding handfuls of milliseconds to a query that’s made often, and you’re now holding page requests up longer. The same thing applies to optimizations as well. Break that [bad] query into two small fast ones, and watch how usage can change all over the system pretty quickly. Databases respond a bit faster, pages get built quicker, which means users click on more links, etc. This second-order effect of optimization is probably pretty familiar to those of us running sites of decent scale.

Working with these systems requires an understanding not only of the way technology interacts, but the way that people do as well. The structure, operation, and development of a website mirrors the organization that creates it, which is why so many people in WebOps focus on understanding and improving management culture & process.

Organizing a conference like Velocity is a wonderful challenge because it requires the same sort of thinking. We focus on the big concepts that everyone needs to know and then go deep into the technologies that change our understanding of the system. We find ways to share the unique experience that can only be gained by operating at scale. We make it safe to share as much of the "Secret Sauce" as we can.

Please join us at Velocity this year, we have an amazing lineup of speakers & participants. Early registration ends on Monday, May 11th at 11:59 PM Pacific. (Radar readers can use "vel09cmb" for an additional 15% discount.)

Velocity, the Web Performance and Operations Conference 2009

Importance of Innovation in Finance & BarCampBank

Jesse Robbins 2009-04-20

“Progress is not the mere correction of evils. Progress is the constant replacing of the best there is with something still better.” -Edward Filene

logobarcampbank.pngTwo years ago, when we were organizing the first BarCampBank in the US, many people found it hard to believe that banks & credit unions could a place for meaningful grassroots innovation. Even crazier was the idea of organizing an unconference to begin bringing open source, transparency, identity, and community into the very closed world of banking & finance.

Since then the BarCampBank idea has turned into a movement. There have been over 14 events all over the world, and many of the ideas generated are beginning to turn into action.

To me, the global financial system is a platform must always create more value than it captures. Tim explained this in his Work on Stuff that Matters post, saying:

“A bank that loans money to a small business sees that business grow, perhaps borrow more money, hire employees who make deposits and take out loans, and so on. The power of this cycle to lift people out of poverty has been demonstrated by microfinance institutions like the Grameen Bank. Grameen is clearly focused on creating more value than they capture; not so the like of Fannie Mae and Freddy Mac, or WaMu, or many of the other failed financial institutions involved in the current financial meltdown.”

There has never been a more important time to bring meaningful innovation into the financial system, and there has never been more opportunity for our community to make it happen.

The next event is occurring this weekend (April 25-26, 2009) on Treasure Island in San Francisco.

sfbarcamplogo-med.jpg After that, the following events are planned:

Data Center Power Efficiency(数据中心的电力效率)

Jesse Robbins 2008-11-29

James Hamilton is one of the smartest and most accomplished engineers I know. He now leads Microsoft's Data Center Futures Team, and has been pushing the opportunities in data center efficiency and internet scale services both inside & outside Microsoft. His most recent post explores misconceptions about the Cost of Power in Large-Scale Data Centers:

jameshamilton.jpg

I’m not sure how many times I’ve read or been told that power is the number one cost in a modern mega-data center, but it has been a frequent refrain. And, like many stories that get told and retold, there is an element of truth to the it. Power is absolutely the fastest growing operational costs of a high-scale service. Except for server hardware costs, power and costs functionally related to power usually do dominate.

However, it turns out that power alone itself isn’t anywhere close to the most significant a cost. Let’s look at this more deeply. If you amortize power distribution and cooling systems infrastructure over 15 years and amortize server costs over 3 years, you can get a fair comparative picture of how server costs compare to infrastructure (power distribution and cooling). But how to compare the capital costs of server, and power and cooling infrastructure with that monthly bill for power?

The approach I took is to convert everything into a monthly charge. [...]

James Hamilton explains Datacenter Costs
[link]

翻译:xiaochong

James Hamilton是我知道的最聪明最牛的工程师之一。他现在领导微软Data Center Futures团队,一直在微软内部和外部推动数据中心效率和互联网规模服务的各种机遇。下面是他最新的文章,指出了人们关于大规模数据中心电力成本方面的一些误解:

我无数次听到大家讲在现代化大型数据中心中电力成本是最大的一部分成本。事实未必如此。电力成本无疑是增长最快的运营成本。除了服务器硬件成本之外电力成本以及电力相关成本通常是最大的一块。

然而实际上电力单独一部分的成本并不是总成本中最大的。让我们来看一下。将配电系统和制冷系统这些基础设施的成本分摊到15年中,将服务器成本分摊至3年中,这样就可以很好地比较服务器成本和基础设施(配电和制冷)成本了。但是如何比较服务器、电力以及制冷基础设施的资本成本和每月电费的关系?

我的办法是把一切都分摊到每个月中。

Velocity 2009: Themes, ideas, and call for participation...

Jesse Robbins 2008-11-21

velocity2009_120x421.gifLast year's Velocity conference was an incredible success. We expected around 400 people and we ended up maxing out the facility with over 600. This year we're moving the conference to a bigger space and extending it to 3 days to accommodate workshops and longer sessions. Velocity 2009 will be on June 22-24th, 2009 at the Fairmont Hotel in San Jose, CA. This year's conference will be especially important. I've said many times that Web Performance and Operations is critical to the success of every company that depends on the web. In the current economic situation, it's becoming a matter of survival. The competitive advantage comes from the ability to do two things:
  1. Generate more revenue with fewer resources
  2. Respond quickly to change
Our Velocity 2009 mantra is "Fast, Scalable, Efficient, Available", a slight change from last year. (We've replaced "Resilient" with "Efficient" to make focus clear.)

I'm excited to announce that joining Steve Souders & I on this year's program committee are John Allspaw, Artur Bergman, Scott Ruthfield, Eric Schurman, and Mandi Walls.  We've already started working on the program, and have just opened the Call for Participation.

We're especially interested in the following topics:

  • How to tie web performance and operations to the bottom line
  • Real-world incident management - getting “tight like a pit crew”
  • Making websites as fast and reliable as desktop apps
  • Networking, DNS, and load balancing
  • Profiling everywhere: JavaScript, CSS, and the network
  • Managing web services - flaming disasters you survived and lessons learned
  • The intersection between performance and design
  • Wicked cool (and actionable) metrics
  • Ads, ads, ads - the performance killer?
  • Troubleshooting in production
  • How to scale and be fast on the social web
  • Capacity planning and load testing
  • Establishing performance and operations best practices within your organization
  • Configuration management best (and worst) tools and practices
  • Monitoring and instrumentation experiences: Open Source, as a service, commercially supported solutions
  • Using multiple CDNs to improve customer experience and reduce cost
The submission deadline is January 5th, so get your talks in.  If you have any questions or suggestions for the committee, send them to velocity-idea@oreilly.com.

Major milestone for ProgrammableWeb & "The Web as Platform"

Jesse Robbins 2008-11-03

200811031544.jpg Last week marked an important milestone for the "Web as Platform" as the 1,000 API was added to the ProgrammableWeb registry. John Musser (see: Web2.0 Report) started tracking the first few web service API's back in 2005.

How do these 1000 APIs break down by type? The following chart, derived from our database, shows the the top 15 sectors or markets with the greatest number of competing API providers. As you can see there are already 71 mapping-related APIs alone"

200811031528.jpg

Congratulations!

DisasterTech: "Decisions for Heros"

Jesse Robbins 2008-11-01

One of the most interesting DisasterTech projects I've been following is "Decisions for Heroes" led by developer and Irish Coast Guard volunteer Robin Blandford.

Decisions is like Basecamp for volunteer Search & Rescue teams. The focus is on providing "just enough" process to compliment the real-world workflow of a rescue team, without unnecessary complexity. One of Robin's design goals is that: decisions-for-heros.png

User requirements are nil. Nobody likes reading manuals - if we have to write one, we've gotten too complicated.

This is the winning approach for building systems that "serve those that serve others", and is echoed by InSTEDD's design philosophy and the Sahana disaster management system.

Teams begin by entering their responses to incidents and training exercises. They then tag them with things like the weather conditions, the tools and skills required, and who from the team was deployed.

As a team's incident database grows this information can be used to show heatmaps, and provide powerful insight on the locations, weather conditions, and times of year that various incidents occur. Over time this kind of data could be analyzed in aggregate across multiple teams and regions and create an incredibly powerful resource for Emergency Managers. This is very similar to what Wesabe does for consumers with financial transaction data today (disclosure: OATV investment).

200811011649.jpg

Rescue team members enter training dates and levels. The system tracks certification expiration dates and prompts team members & leaders to plan classes and remain current. This is a huge issue for volunteers who have to manage professional-level training requirements with the demands of a regular career.

As more incidents are entered into the system, it compares the skills required for each of the rescues with the team training exercises. This allows teams to identify areas to focus, train, and develop new skills.

200811011644.jpg

This is an innovative project with tremendous potential, and hopefully an early signal of coming changes in Emergency Management.

(Note: ''How to Serve those that Serve Others" will be the theme of my "High Order Bit" session at the Web2.0 Summit.  I'll be sure to post video/slides/notes when they are available.)

Sprint blocking Cogent network traffic...

Jesse Robbins 2008-10-31

It appears that Sprint has stopped routing traffic (called "depeering") from Cogent as a result of some sort of legal dispute. Sprint customers cannot reach Cogent customers, and vice versa. The effect is similar to what would happen if Sprint were to block voice phonecalls to AT&T customers.

Here's a graph that shows the outage, courtesy of Keynote :
sprint-cogent-routing-problems-keynote.png

Rich Miller at DataCenterKnowledge has a great summary of the issues behind the incident, which has happened with Cogent before. Rich says:

At the heart of it, peering disputes are really loud business negotiations, and angry customers can be used as leverage by either side. This one will end as they always do, with one side agreeing to pay up or manage their traffic differently.

I think this is particularly Radar-worthy because it provides an example of the complex issues around Net Neutrality.In this case customers are harmed and most (especially Sprint wireless customers) will have no immediate recourse.

Todd Underwood of Renesys has posted an incredibly detailed explanation the scope and impact of this issue. Here is a summary:

Another way to look at the scope of this event is to identify the number, size and ownership of the network prefixes affected by the outage. [...] So, in total, at least 3500 networks on the Internet have less than full connectivity right now. [...]

One might suspect that these single-homed autonomous systems are simply incautious or insignificant networks. After all, given the history of Internet partitions, who would be rash enough to have important network services located on a single-homed prefix in this day and age?

The following prefixes are some of the more interesting networks single-homed behind Sprint:

  • 208.95.96.0/21 Expedia, Inc.
  • 164.62.0.0/16 Federal Trade Commission
  • 204.108.8.0/24 Federal Aviation Administration
  • 198.9.201.0/24 National Aeronautics and Space Administration
  • 170.189.200.0/24 Occidental Petroleum Corporation
  • 148.168.0.0/16 Pfizer Inc.
  • 128.6.0.0/16 Rutgers University
  • 173.100.0.0/16 Sprint PCS (lots of networks here, of course)
  • 149.24.174.0/23 SUNGARD HIGHER EDUCATION INC.

And that is just a few.

The following prefixes are some of the more interesting networks single-homed behind Cogent:

  • 89.251.2.0/24 Joost Production Benelux Network
  • 72.5.224.0/24 Loopt, Inc.
  • 198.185.178.0/23 National Aeronautics and Space Administration and many more)
  • 204.201.48.0/21 NTT America, Inc. (and many more like it, from the T1 and hosting customers acquired from NTT/Verio)
  • 204.9.56.0/24 Skynet Access (this might actually be good news, if the loss of connectivity to Skynet prevents or delays sentience).
  • 142.155.0.0/16 St. Lawrence College
  • 128.100.0.0/16 University of Toronto (and a bunch of other colleges and universities)
[...] The point here is that this is a big deal. There are lots of significant organizations that appear to have lost connectivity due to this dispute.

These same kinds of issues will likely happen with cloud service providers as well. As we've already learned from the evolution of VoIP, you become what you disrupt.

Amazon's new EC2 SLA

Jesse Robbins 2008-10-24

Amazon announced a new SLA for EC2, similar to the one for S3. This is a notable step for Amazon and cloud computing as a whole, as it establishes a new bar for utility computing services.

Amazon is committing to 99.95% availability for the EC2 service on a yearly basis, which corresponds to approximately four hours and twenty three minutes of downtime per year. It's important to remember that an SLA is just a contract that provides a commitment to a certain level of performance and some form of compensation when a provider fails to meet it.

Here's the summary of the EC2 SLA (emphasis added):
Service Commitment AWS will use commercially reasonable efforts to make Amazon EC2 available with an Annual Uptime Percentage (defined below) of at least 99.95% during the Service Year. In the event Amazon EC2 does not meet the Annual Uptime Percentage commitment, you will be eligible to receive a Service Credit as described below. [...]
  • “Annual Uptime Percentage” is calculated by subtracting from 100% the percentage of 5 minute periods during the Service Year in which Amazon EC2 was in the state of “Region Unavailable.” If you have been using Amazon EC2 for less than 365 days, your Service Year is still the preceding 365 days but any days prior to your use of the service will be deemed to have had 100% Region Availability [...]
  • “Unavailable” means that all of your running instances have no external connectivity during a five minute period and you are unable to launch replacement instances. [...]
To receive a Service Credit, you must submit a request by sending an e-mail message to aws-sla-request @ amazon.com. To be eligible, the credit request must [...] include your server request logs that document the errors and corroborate your claimed outage (any confidential or sensitive information in these logs should be removed or replaced with asterisks)

This new SLA does not appear to address the reliability of server instances individually or in aggregate. For example, if half of a customer's EC2 instances lose their connections or die every 6 minutes, EC2 would still be considered "available" even if it is essentially unusable.

If the entire EC2 service is down a cumulative four hours and twenty minutes, customers must furnish proof of the outage to Amazon to be eligible for the 10% credit. This seems like an onerous process for very little compensation, and isn't in-line with Amazon's famous "Relentless Customer Obsession". Amazon takes monitoring very seriously and should take the lead by tracking, reporting, and proactively compensating customers when it lets them down.

Incredible images of the Sun

Jesse Robbins 2008-10-15

sol17.jpg
The Boston Globe has assembled a beautiful gallery of images of the Sun.
This LASCO C2 image, taken 8 January 2002, shows a widely spreading coronal mass ejection (CME) as it blasts more than a billion tons of matter out into space at millions of kilometers per hour. The C2 image was turned 90 degrees so that the blast seems to be pointing down. An EIT 304 Angstrom image from a different day was enlarged and superimposed on the C2 image so that it filled the occulting disk for effect (Courtesy of SOHO/LASCO consortium)

[link courtesy Barry Brumitt]

Apple's restrictions mean more jailbreaking & Android adoption(Apple的限制只能造成更多越狱并将大家推向Android)

Jesse Robbins 2008-09-24

When Apple announced the iPhone SDK last year I said:

[...] Jobs makes it clear that the platform won't be completely open. While he says that this is to balance the benefits of an open platform with user security protection, it's unclear where Apple will draw those lines. Will there be a Skype client? Third-party media apps?

It would have been better if Apple had announced [the details] when it released the iPhone. I'm hopeful that Apple will now embrace the existing iPhone developer community, and won't use “security” as a way to keep potential competitors off its platform.

Almost a year later Apple is using their control of the App store to block innovative developers from reaching their customers. The most recent example is the "Podcaster" iPhone app which allows you to download and manage podcasts on the iPhone directly, without having to boot your computer to sync in iTunes.

According to the developer, Apple blocked this application from the App store, saying:

Since Podcaster assists in the distribution of podcasts, it duplicates the functionality of the Podcast section of iTunes.

If you want to build a platform, you have to compete fairly with the developers on your platform (if you must to compete at all). By restricting developers, Apple is stifling innovation and their long-term growth. Frustrated customers and developers who "think different" are Jailbreaking their iPhones and getting excited about Google's Android.

Remember: Successful platforms create more value than they capture.

翻译:西门吹雪

去年Apple发布iPhone SDK时我写道

……乔布斯明确指出该平台不会完全开放。他称此举是在平台开放性和用户安全保护之间取得平衡,不清楚Apple会在平衡到什么程度。会产生Skype客户端?或者第三方媒体应用?

Apple发布iPhone时宣布详细信息会更好。我希望Apple能拥抱已经存在的iPhone开发社区,而不是用“安全”借口将潜在的竞争者隔离在平台之外。

Apple采用App Store控制将革新开发人员与客户隔离开来快一年了。最新的例子就是“Podcaster”,这个应用允许用户直接在iPhone上下载并管理播客,而无需用计算机通过iTunes来同步。

据该开发人员称App Store将其拒之门外,理由是

Podcaster处理播客,与iTunes的播客功能重复。

如果你构建了一个平台就必须与该平台上的开发人员公平竞争(如果一定要竞争)。通过限制开发人员Apple阻碍了革新和自身的长远发展。失望的客户和开发人员有不同想法,只能越狱iPhone,并对Google的Android欢呼雀跃。

请记住:成功的平台要创造比他们索取到的更多的价值

Kaminsky DNS Patch Visualization

Jesse Robbins 2008-08-07

Dan Kaminsky has posted the details of the widespread DNS vulnerability. Clarified Networks created this visualization of DNS patch deployment over the past month:

Red = Unpatched
Yellow = Patched, "but NAT is screwing things up"
Green = OK

The new internet traffic spikes

Jesse Robbins 2008/06/28

Theo Schlossnagle, author of Scalable Internet Architectures, gave a great explanation of how internet traffic spikes are shifting:

Lately, I see more sudden eyeballs and what used to be an established trend seems to fall into a more chaotic pattern that is the aggregate of different spike signatures around a smooth curve. This graph is from two consecutive days where we have a beautiful comparison of a relatively uneventful day followed by long-exposure spike (nytimes.com) compounded by a short-exposure spike (digg.com):

The disturbing part is that this occurs even on larger sites now due to the sheer magnitude of eyeballs looking at today's already popular sites. Long story short, this makes planning a real bitch.

[...]What isn't entirely obvious in the above graphs? These spikes happen inside 60 seconds. The idea of provisioning more servers (virtual or not) is unrealistic. Even in a cloud computing system, getting new system images up and integrated in 60 seconds is pushing the envelope and that would assume a zero second response time. This means it is about time to adjust what our systems architecture should support. The old rule of 70% utilization accommodating an unexpected 40% increase in traffic is unraveling. At least eight times in the past month, we've experienced from 100% to 1000% sudden increases in traffic across many of our clients.

[Link]

user/jesse_robbins.txt · 最后更改: 2010/01/23 由 radarman
O'Reilly Home | O'Reilly Beijing | Ignite China(点燃之夜在中国) | Privacy Policy ©2005-2010, O'Reilly Media, Inc.
All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners.
京ICP备05003502号