I will be leaving Yahoo! at the end of this month to join Cloudera.
About five years ago I was working with Mike Cafarella on Apache Nutch, an open-source web-search engine. Initially we were able to crawl and index on four machines in parallel, but with a lot of manual steps. Inspired by two Google papers, we implemented a distributed filesystem and MapReduce implementation that automated most of these steps. Operation became much simpler, and we were then able to easily run Nutch on twenty machines, with near-linear scaling.
But to scale to the many billions of pages in the web we’d need to be able to run it on thousands of machines. And the more we worked on it the more I realized that would take a lot more developers and resources than we had to make this happen.
Yahoo! proposed to fill this gap. Eric Baldeschwieler led a team with talented folks, like Owen O’Malley, Sameer Paranjpye, and Nigel Daley. Eric said he’d dedicate his team to scaling this system to be able to process the full web. So, three and half years ago, I joined Yahoo! to help make this happen.
We exceeded my dreams. First we moved the distributed computing code out of Nutch into a new Apache project christened Hadoop. Then we set out to improve scalability, performance, and reliability, all the while adding many features. After one year Hadoop was used daily by many research groups within Yahoo!. After two years it generated Yahoo!’s web search index, achieving web-scale. Now, after three years, Hadoop holds the big-data sort record and the project has become a de-facto industry standard for big-data computing, used by scores of companies. The recent Hadoop Summit was attended by over 750 people from around the world.
Many folks at Yahoo! were instrumental in this story, including: Raymie Stata, Dhruba Borthakur, Arun C Murthy, Devaraj Das, Raghu Angadi, Hairong Kuang, Konstantin Shvachko, Runping Qi, Chris Douglas, Allen Wittenauer, Sharad Agarwal and Hemanth Yamijala, to name just a few. Yahoo! deserves enormous and ongoing thanks for the key role it plays in making Hadoop useful.
Now Hadoop is a thriving open-source project, with large and diverse developer and user communities. Going forward, Cloudera presents an opportunity to work with a wider range of Hadoop users. I hope to help synthesize these many voices into a project that best serves all.
Hadoop has grown to be a large, active, project very quickly, but it is still a young project. At Cloudera I will be well positioned to help it mature. This move will not fundamentally change my day-to-day activities. I will continue to work on Hadoop, working closely with developers from Yahoo! and elsewhere to build great software.
August 10, 2009 at 12:08 pm |
[...] — that Doug Cutting, co-founder of the Apache Hadoop project and creator of Nutch and Lucene, has agreed to join Cloudera beginning on September 1, 2009. Doug’s contributions to Hadoop over the [...]
August 10, 2009 at 12:40 pm |
Congratulations Doug! :)
August 10, 2009 at 1:07 pm |
[...] http://blog.lucene.com/2009/08/10/joining-cloudera/ [...]
August 10, 2009 at 2:29 pm |
Congratulations, and good luck!
August 10, 2009 at 5:50 pm |
[...] bring us to the last news item: Doug Cutting is leaving Yahoo for Cloudera, where he’ll continue to work on Hadoop. According to his blog post about it, [...]
August 10, 2009 at 6:27 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 6:32 pm |
Good luck and keep up the good work!
August 10, 2009 at 6:59 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 7:08 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 7:08 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 7:12 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 7:18 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 9:17 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 9:55 pm |
Congratulations Doug… excelent decision :)
August 10, 2009 at 11:01 pm |
[...] his personal blog, Cutting says he’ll be doing much the same work with Cloudera that he was doing at Yahoo, and will continue [...]
August 11, 2009 at 4:10 am |
[...] his Hadoop project. (Picture from Facebook.) The timing was coincidental, he insisted. In a blog post he heaped fulsome praise on Yahoo and said that at Cloudera he will be "well-positioned to help it [...]
August 11, 2009 at 4:46 am |
Good Luck!!!!
August 11, 2009 at 5:49 am |
[...] Cutting, creator of open-source software framework Hadoop,has left Yahoo to join Cloudera, a Burlingame, Calif.-based startup that is commercializing Hadoop. The center of [...]
August 11, 2009 at 12:19 pm |
Just adding my congrats to the pile!
August 11, 2009 at 7:19 pm |
Congrats, Doug!
August 11, 2009 at 8:44 pm |
Congrats, Good luck.
August 11, 2009 at 11:04 pm |
[...] timing was coincidental, he insisted. In a blog post he heaped fulsome praise on Yahoo and said that at Cloudera he will be “well-positioned to [...]
August 12, 2009 at 11:53 am |
Good luck with Cloudera, glad to know you’ll still be involved in Hadoop.
August 12, 2009 at 2:22 pm |
[...] Doug Cutting is leaving Yahoo! and will be joining Cloudera. (Via Free Search) [...]
August 12, 2009 at 6:48 pm |
[...] Joining Cloudera « Free Search [...]
August 12, 2009 at 11:44 pm |
[...] – Doug Cutting deja Yahoo! para irse a Cloudera y seguir desarrollando [...]
August 13, 2009 at 12:02 am |
[...] Joining Cloudera [...]
August 13, 2009 at 11:03 pm |
[...] dimenticavo: ha lasciato [...]
August 14, 2009 at 9:14 pm |
[...] search and infrastructure expert Doug Cutting is leaving the company to join Cloudera. He will be leaving Yahoo! at the end of August, 2009. Cutting created [...]
August 17, 2009 at 1:53 am |
[...] Joining Cloudera [...]
August 17, 2009 at 1:16 pm |
[...] search and infrastructure expert Doug Cutting is leaving the company to join Cloudera. He will be leaving Yahoo! at the end of August, 2009. Cutting created [...]
August 18, 2009 at 6:47 am |
hearty congratulations….
August 21, 2009 at 11:19 am |
Doug, congratulation. Nutch was great since the very beginning and what you made after that was awesome. I cannot wait to see the rest.
September 14, 2009 at 10:46 pm |
hey Doug, you r a great guy. Now i use your frame work and learn it and make it for myanmar search engine
September 16, 2009 at 2:46 am |
Congratulations Doug, I was totally unaware of Lucene few days back untill I found this great article, simple though usefull, http://www.ezdia.com/Lucene_in_five_minutes/Content.do?id=674
September 20, 2009 at 3:33 pm |
When I started to read this post, my first thought was that you were moving away from your creation.
Thank you for clarifying that your move will allow you to help your “child” not only to walk, but to run.
October 6, 2009 at 8:31 am |
Hi, Doug
I am a research analyst based in Dubai, United Arab Emirates. I am currently researching about open source search engine in Arabic. Would greatly appreciate if you are able to provide me with your contact details through the e-mail address provided, since I am unable to get through Cloudera’s telephone number.
Thanks and kindest regards.
October 12, 2009 at 8:42 am |
[...] recent additions to the Cloudera team is Doug Cutting, a search engine specialist from Yahoo and one of the founders of the Hadoop project. This is a big loss for Yahoo and a huge gain for [...]
October 15, 2009 at 8:56 pm |
Congratulations Doug – Good Luck
December 2, 2009 at 12:33 pm |
Important gift for anyone who wants one way backlinks for no cost. Anyone need free backlinks for their blog? I figured I would distribute some good information I discovered recently. Free backlinks for your blog. I have been taking advantage of this this for my blogs and it really works great! Click my name to see what I mean. Not selling anything, it’s completely free of charge and it works.
December 11, 2009 at 3:32 am |
Picture than analog, the heart you?From a pessimistic, = – beats.Waltzes among the, need more then.Suffering The more free, South Carolina has is great tool.The chimney or, applicant must be.,
December 11, 2009 at 9:00 pm |
既然来了,济南二手房(esf.jnol.cn)就冒个泡吧,呵呵,博主博客不错,加油,多多交流哈!
December 20, 2009 at 8:13 pm |
Congratulation and GOOD LUCK DUDE!!!
December 26, 2009 at 4:53 pm |
Прикольная статья. Кое-что новое узнал для себя. Автору респект и уважуха :)
January 4, 2010 at 8:39 pm |
I enjoying reading your post. You make valid points in a concise and pertinent fashion, This is a really good read for me, thank you for your time. one of my articles hope u enjoy reading free online paid surveys
January 14, 2010 at 1:29 am |
Очень забавные мысли, хорошо рассказано, все просто разложено по полкам :)