I will be leaving Yahoo! at the end of this month to join Cloudera.
About five years ago I was working with Mike Cafarella on Apache Nutch, an open-source web-search engine. Initially we were able to crawl and index on four machines in parallel, but with a lot of manual steps. Inspired by two Google papers, we implemented a distributed filesystem and MapReduce implementation that automated most of these steps. Operation became much simpler, and we were then able to easily run Nutch on twenty machines, with near-linear scaling.
But to scale to the many billions of pages in the web we’d need to be able to run it on thousands of machines. And the more we worked on it the more I realized that would take a lot more developers and resources than we had to make this happen.
Yahoo! proposed to fill this gap. Eric Baldeschwieler led a team with talented folks, like Owen O’Malley, Sameer Paranjpye, and Nigel Daley. Eric said he’d dedicate his team to scaling this system to be able to process the full web. So, three and half years ago, I joined Yahoo! to help make this happen.
We exceeded my dreams. First we moved the distributed computing code out of Nutch into a new Apache project christened Hadoop. Then we set out to improve scalability, performance, and reliability, all the while adding many features. After one year Hadoop was used daily by many research groups within Yahoo!. After two years it generated Yahoo!’s web search index, achieving web-scale. Now, after three years, Hadoop holds the big-data sort record and the project has become a de-facto industry standard for big-data computing, used by scores of companies. The recent Hadoop Summit was attended by over 750 people from around the world.
Many folks at Yahoo! were instrumental in this story, including: Raymie Stata, Dhruba Borthakur, Arun C Murthy, Devaraj Das, Raghu Angadi, Hairong Kuang, Konstantin Shvachko, Runping Qi, Chris Douglas, Allen Wittenauer, Sharad Agarwal and Hemanth Yamijala, to name just a few. Yahoo! deserves enormous and ongoing thanks for the key role it plays in making Hadoop useful.
Now Hadoop is a thriving open-source project, with large and diverse developer and user communities. Going forward, Cloudera presents an opportunity to work with a wider range of Hadoop users. I hope to help synthesize these many voices into a project that best serves all.
Hadoop has grown to be a large, active, project very quickly, but it is still a young project. At Cloudera I will be well positioned to help it mature. This move will not fundamentally change my day-to-day activities. I will continue to work on Hadoop, working closely with developers from Yahoo! and elsewhere to build great software.
August 10, 2009 at 12:08 pm |
[...] — that Doug Cutting, co-founder of the Apache Hadoop project and creator of Nutch and Lucene, has agreed to join Cloudera beginning on September 1, 2009. Doug’s contributions to Hadoop over the [...]
August 10, 2009 at 12:40 pm |
Congratulations Doug! :)
August 10, 2009 at 1:07 pm |
[...] http://blog.lucene.com/2009/08/10/joining-cloudera/ [...]
August 10, 2009 at 2:29 pm |
Congratulations, and good luck!
August 10, 2009 at 5:50 pm |
[...] bring us to the last news item: Doug Cutting is leaving Yahoo for Cloudera, where he’ll continue to work on Hadoop. According to his blog post about it, [...]
August 10, 2009 at 6:27 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 6:32 pm |
Good luck and keep up the good work!
August 10, 2009 at 6:59 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 7:08 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 7:08 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 7:12 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 7:18 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 9:17 pm |
[...] his blog post explaining the move, Cutting specifically states that he joined Yahoo in order to get the resources [...]
August 10, 2009 at 9:55 pm |
Congratulations Doug… excelent decision :)
August 10, 2009 at 11:01 pm |
[...] his personal blog, Cutting says he’ll be doing much the same work with Cloudera that he was doing at Yahoo, and will continue [...]
August 11, 2009 at 4:10 am |
[...] his Hadoop project. (Picture from Facebook.) The timing was coincidental, he insisted. In a blog post he heaped fulsome praise on Yahoo and said that at Cloudera he will be "well-positioned to help it [...]
August 11, 2009 at 4:46 am |
Good Luck!!!!
August 11, 2009 at 5:49 am |
[...] Cutting, creator of open-source software framework Hadoop,has left Yahoo to join Cloudera, a Burlingame, Calif.-based startup that is commercializing Hadoop. The center of [...]
August 11, 2009 at 12:19 pm |
Just adding my congrats to the pile!
August 11, 2009 at 7:19 pm |
Congrats, Doug!
August 11, 2009 at 8:44 pm |
Congrats, Good luck.
August 11, 2009 at 11:04 pm |
[...] timing was coincidental, he insisted. In a blog post he heaped fulsome praise on Yahoo and said that at Cloudera he will be “well-positioned to [...]
August 12, 2009 at 11:53 am |
Good luck with Cloudera, glad to know you’ll still be involved in Hadoop.
August 12, 2009 at 2:22 pm |
[...] Doug Cutting is leaving Yahoo! and will be joining Cloudera. (Via Free Search) [...]
August 12, 2009 at 6:48 pm |
[...] Joining Cloudera « Free Search [...]
August 12, 2009 at 11:44 pm |
[...] – Doug Cutting deja Yahoo! para irse a Cloudera y seguir desarrollando [...]
August 13, 2009 at 12:02 am |
[...] Joining Cloudera [...]
August 13, 2009 at 11:03 pm |
[...] dimenticavo: ha lasciato [...]
August 14, 2009 at 9:14 pm |
[...] search and infrastructure expert Doug Cutting is leaving the company to join Cloudera. He will be leaving Yahoo! at the end of August, 2009. Cutting created [...]
August 17, 2009 at 1:53 am |
[...] Joining Cloudera [...]
August 17, 2009 at 1:16 pm |
[...] search and infrastructure expert Doug Cutting is leaving the company to join Cloudera. He will be leaving Yahoo! at the end of August, 2009. Cutting created [...]
August 18, 2009 at 6:47 am |
hearty congratulations….
August 21, 2009 at 11:19 am |
Doug, congratulation. Nutch was great since the very beginning and what you made after that was awesome. I cannot wait to see the rest.
September 14, 2009 at 10:46 pm |
hey Doug, you r a great guy. Now i use your frame work and learn it and make it for myanmar search engine
September 16, 2009 at 2:46 am |
Congratulations Doug, I was totally unaware of Lucene few days back untill I found this great article, simple though usefull, http://www.ezdia.com/Lucene_in_five_minutes/Content.do?id=674
September 20, 2009 at 3:33 pm |
When I started to read this post, my first thought was that you were moving away from your creation.
Thank you for clarifying that your move will allow you to help your “child” not only to walk, but to run.
October 6, 2009 at 8:31 am |
Hi, Doug
I am a research analyst based in Dubai, United Arab Emirates. I am currently researching about open source search engine in Arabic. Would greatly appreciate if you are able to provide me with your contact details through the e-mail address provided, since I am unable to get through Cloudera’s telephone number.
Thanks and kindest regards.
October 12, 2009 at 8:42 am |
[...] recent additions to the Cloudera team is Doug Cutting, a search engine specialist from Yahoo and one of the founders of the Hadoop project. This is a big loss for Yahoo and a huge gain for [...]
October 15, 2009 at 8:56 pm |
Congratulations Doug – Good Luck
March 4, 2010 at 3:51 am |
The fact that infertility is on the increase among couples in both developed and developing countries of the world is definitely not contentious. However, the role that the male factor plays in infertility has consistently been debated. In the past, it was considered unthinkable to suggest that the male was the sick party when conceiving becomes a problem. Although things have changed a lot, people now understand that male infertility too is a factor to be considered when a couple suffers infertility, there is still not so much information and/or knowledge about male infertility, as there is about female infertility; at least among the general population.
March 5, 2010 at 8:30 am |
Exactly, what i was looking for. Thank you.