Zhi-Zhuo Zhang

MOTTO:

 “Good judgments come from experience, but experience comes from bad judgments.”

Updated: Oct 1, 2007

Zhi-Zhuo Zhang

My Bachelor Thesis

Research INTEREST:

I have wide research interests, mainly including artificial intelligence, machine learning, data mining, information retrieval, pattern recognition, evolutionary computation, and neural computation, among which machine learning and data mining are my main research directions.

 

Education:

l  1997-2004 Study in Guangzhou No.2 middle School.

Ø  2002-2003 Represent Guangzhou City to attend the National Physics Competition and won the second prize (in 2003) and the third prize (in 2004) of Guangdong Province.

Ø  2003-2004 Be sent to the privilege class of Guangzhou City for the gifted in physics.

Ø  Attend the national training course for Mathematics Olympic Competition in Beijing.

l  BS: 2004-now 04Bilingual class of Computer Science in South China University of Technology (top 5% student| GPA: 85/100)

 

Publication:

1.      Zhi-Zhuo Zhang et.al, 2007, Ranking Potential Customers based on Group-Ensemble (PDF) (the special Spring 2008 issue of the International Journal of Data Warehousing and Mining, Accepted)

2.      Ying-Peng Zhang, Zhi-Zhuo Zhang, Qiong Chen, 2007, A New Nearest Neighbor Searching Algorithm based on M2M Model (PDF) (IMECS 2007 press, EI indexed )

3.      Ying-Peng Zhang, Zhi-Zhuo Zhang, Qiong Chen, 2007, A New Randomized Parallel Dynamic Convex Hull Algorithm based on M2M Model (PDF) (ACM, CTIC2007, Accepted)

 

Main Research Projects:

(Not including course projects)

1M2M Computation Model(Main contribution, 2007)

         This model present a hierarchical data structure which is very suitable for many points set operations. And now, M2M model has successfully been applied on convex hull and path finding problem as well. Most impressive thing of M2M model is its novel properties including preprocessing sharing, high parallel processing, easily trade off between different merits. This project also wins the outstanding award in the Challenge Cup of Guangdong province recently and will present in the national Challenge Cup in October 2007. The paper about nearest neighbor searching base M2M model has been published in IMECS2007 and indexed by EI.

 

2 Ranking Potential Customers Based on GroupEnsemble Method (Team leader, 2007)

       The paper presents a solution for PAKDD competition 2007, which is about a cross-selling in credit card and house load. Data imbalance, value missing and time-variant distribution, which are common in the real world data, also occur in this problem. We establish a 3-layers machine learning model including Bagging, RankBoost and Expending Regression Tree, to handle this problem. More detail about the problem and competition will be http://lamda.nju.edu.cn/conf/pakdd07/dmc07/ .

 

3 Netflix Movie Rating Prediction (Team leader, 2007)

       We present two prediction models for the two problem “who will rate the movie” and “how many person will rate the movie” corresponding to the two tasks in KDD CUP 2007.  The main difficulties of those problems are huge amount of data and very weak relation in the given data. SVD-Decomposition, Co-Cluster, Graph-Based method, Naïve bayes method and adjusted collaborative filtering can well solve the first difficulty but the second difficulty may have to rely on other prior knowledge to improve it. More detail about the problem and competition will be http://www.cs.uic.edu/~liub/Netflix-KDD-Cup-2007.html.

 

4.Hand-writing Chinese Character Recognition System (Project leader,2006)

This project wins the third prize of Universities’ Software Competition of Guangdong Province 2006. On-line recognition has capacity of 400 Chinese characters and off-line 20 characters now. The recognitions both satisfy the invariants of zooming, transforming and rotating. The main aim of this system is providing a platform to test image processing algorithm and recognizing algorithm.

 

5Distributed Firewall System Based on Intrusion Detection (Project leader, 2005)

       This project wins the second prize of Universities’ Software Competition of Guangdong Province 2005. The idea of this project comes from SOA and P2P service. Each agent cooperate to collect the information of the current status of network security and update the detect rule and detect components in various ways. The system combines misuse detection and anomaly detection, and supports novel concepts including dynamic action script, attack feature extraction, stateful inspection.

 

6GA-ANN-GA Control Model in Space Travel (Main contribution, 2006)

This project wins the best demonstrating award of Universities’ Software Competition of Guangdong Province 2006. This project applies the genetic algorithm to train the forward neural network in order to make it become familiar with the complex situation of space travel and control the spaceship automatically. Several novel techniques are invented by us in this model. We prove adding linear layers in ANN can boost the GA training and GA can be a good supervisor when it is expensive to find a human trainer. 

 

7. Text-based Protocol Detection and Learning (Project leader, 2006)

       Many internet applications design their own communication protocols, and most of those protocols are based on text. It is an interesting machine learning question that whether our machines can detect a communication channel and learn the protocol in the channel. This project tries to handle this task with some text classification methods, and evaluate the performance by two factors: accuracy and classification delay.

 

8.   Organ Procurement and Transplantation Network(OPTN), Survey and Suggestion (Team leader, 2007)

The OPTN, a savior in the eyes of many diseased, saves and prolongs numerous lives. Although the OPTN is functioning, there are still approximately 95,000 candidates waiting for an organ. The efficiency and effectiveness of the network are in dire need of improvement. Our project demonstrates several solutions to the critical bottleneck in current OPTN system.

 

Competition and Award

(International level)

1.      Honorable Mention of Interdisciplinary Contest in Modeling 2007(USA)

2.      PAKDD 2007 Competition (Open Category): the 15th place (47 success submits, 250 more registrations)

3.      KDD CUP 2007: Task1 the 27th place , Task2 the 20th place(39 success submits, 200 more registrations)

 

(National level)

1.EPSON foundation scholarship.(2005)

 

(Province level)

1.Outstanding Award of Challenge Cup of Guangdong(2007)

2.Second prize of Universities’ Software Competition of Guangdong Province (2005)

3.Third prize of Universities’ Software Competition of Guangdong Province (2006)

4.Best demonstrating award of Universities’ Software Competition of Guangdong Province (2006)

5.Third prize of Mathematic Modeling Competition in Guangdong Province.(2006)

 

(School level)

1.Third prize of ACM Programming Competition of SCUT (2006)

2.Second prize of Software Development Competition of SCUT (2005)

3.Third prize of Software Development Competition of SCUT (2006)

4.Third prize of Challenge Cup of SCUT (2007)

5.Excellent Student Monitor Award (2005)

 

Key Skill:

1. Familiar with C++, Java, C# programming language.

2. Familiar with open-source machine learning project such as Weka & Yale. 

3. Experience of working in AI field, especially algorithms of optimizing and machine learning.

4. Familiar with operation of image processing, and handwriting character processing and recognition.

5. Excellent in algorithm complexity analysis and optimization.

6. In-depth knowledge of Web-application Technology (PHP or ASP.net).

7. Familiar with TCP/IP and network security, with the experience of firewall system developing.

8. Skill in Mathematics modeling with practical problem.

9. Familiar with the technology of Information Retrieval and Search Engine.

 

Presentation

1. Application Based on M2M Model

Presented at IAENG International Conference on Artificial Intelligence and Applications (ICAIA'07), Hong Kong, March 23, 2007

Video Download

 

Working Experience:

1. A member of student union of SCUT

2. Technology manager of www.100steps.net

3. Participate in a NetMeeting project of a software company

4. Project manager in ExceedTech team

5. Volunteer in Robot Cup of China 2004

 

English Proficiency

CET 6: 527 (excellent level)

GRE: Verbal 480 + Quantity 800 + Analytical Writing 4

 

Major Courses’ Score:

Artificial Intelligence

99

Physics I

98

Modern Information Retrieval

97

Probability and Statistic

94

Operating System

93

Data Structure

92

Computer Graphics

91

Digital Logic

90

Physics II

90

Calculus I

89

Discrete Mathematics

88

Circuit and Electronic Technology

88

Object-Oriented Analysis and Design

88

Principle of Compilation

87

Algorithm Design

86

Mathematic Modeling

85

Embedded System Design and Develop

85

Programming Language Design and Methods

83

Linear Algebra

80

Distributed Computation

76

Software Project Management

76

Computer Networking

74

Software Engineering

75

Computer Organization and Structure

70

Calculus II

70