ClassCloud: switch your PC Classroom into Cloud Testbed

Speaker(s) : Jazz Wang
Language : English Level : Confirmed Nature : Conference
Date : Wednesday 7 July 2010 Schedule : 14h00 Duration : 40 minutes
Place: LaBRI - Amphi 050
Transversal topics: In english


Cloud Computing is a growing research topic in recent years. The key concept of Cloud Computing is to provide a resource sharing model based on virtualization, distributed file system, parallel algorithm and web services. But how can we provide a testbed for cloud computing related training courses? In this talk we will share our experience to build cloud computing testbed for virtualization, high throughput computing and bioinformatics applications. It covers lots of open source projects, such as DRBL, Xen, Hadoop and bioinformatics related applications.

In short, Diskless Remote Boot in Linux (DRBL) provides a diskless or systemless environment for client machines. It works on Debian, Ubuntu, Mandriva, Red Hat, Fedora, CentOS and SuSE. DRBL uses distributed hardware resources and makes it possible for clients to fully access local hardware.

Xen is one of open source hypervisor for linux kernel. It had been used in Amazon EC2 production environment to provide cloud service model (1) — "Infrastructure as a Service (IaaS)". In this talk, we will show you how DRBL can help on fast deployment of Xen playground in classroom.

Hadoop is becoming the well-known open source cloud computing technology developed by Apache community. It is very power tool for data mining. It had been used in Yahoo and Facebook production environment to provide cloud service model (2) — "Platform as a Service (PaaS)". It’s easy to setup single hadoop node but difficult to manage a hadoop cluster. In this talk, we will show you how DRBL can help on fast deployment and management.

Most bioinformatics applications are open source, such as R, Bioconductor, BLAST, Clustal, PipMaker, Phylip, etc. But it also require traditional cluster job submission. In this talk we will show you how DRBL can help to build a testbed of bioinformatics research and provide cloud service model (3) — "Software as a Service (SaaS)". In this talk, we will cover how to:

- 1. Use DRBL to deploy Xen virtual cluster (drbl-xen)
- 2. Use DRBL to deploy Hadoop cluster (drbl-hadoop)
- 3. Use DRBL to deploy bioinformatics cluster (drbl-biocluster)

A live demonstration about drbl-hadoop and drbl-biocluster will be done in the talk, too.


Jazz Yao-Tsung Wang is a co-developer of DRBL/Clonezilla team in Free Software Lab, NCHC, NARL, Taiwan.

Free Software in NCHC mainly develop open source software for education including DRBL, clonezilla, partclone, Tux2Live, etc. DRBL/Clonezilla team are one of the winners in Trophees du Libre competition in the category "Public sector applications".

National Center for High-performance Computing (NCHC) is the only supercomputing center in Taiwan. It’s a non-profit organization founded by Taiwan’s National Science Council (NSC). Our mission is to assist researchers on a national level.

His speaking experience includes:

- [1] "Building an Cloud Computing Analysis System for Intrusion Detection System", Cloud Slam’09, 2009-04-22
- [2] "BoF: Clonezilla Hands-On Lab", LinuxWorld Conference & Expo 2008, 2008-08-06
- [3] "Massive Deployment of Kerrighed Virtual SMP Cluster using DRBL", Open Source Grid and Cluster Conference 2008, 2008-05-16