Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Curriculum Vitae

1,097 views

Published on

  • Be the first to comment

  • Be the first to like this

Curriculum Vitae

  1. 1. CURRICULUM VITAE Dr. Ioannis A. Sarafis Data Mining and Knowledge Discovery Specialist PhD in Computer Science MSc in Distributed & Multimedia Information Systems BSc in Applied Informatics Approved Postgraduate Tutor at Heriot-Watt University Address Vergas - Kastorias PC. 52052 Mobile number Kastoria, Greece +3 6948959683 (Day Phone number Phone) E-mail address +3 2467098345, +32467080659 ceeis@macs.hw.ac.uk, i.sarafis@gmail.com
  2. 2. PERSONAL INFORMATION Name: Ioannis Surname: Sarafis Father’s and Mother’s Full Name Alexandros Sarafis & Katina Sarafi Date of Birth: 2 August 1975 Place of Birth: Vergas – Kastorias Address: Vergas – Kastorias, P.C. 52052, KASTORIA Telephone: (2467)098345 (Home), 6948959683 (Mobile) E-mail: ceeis@macs.hw.ac.uk, i.sarafis@gmail.com, sarafisi@hotmail.com Web http://www.macs.hw.ac.uk/~ceeis Nationality: Greek Marital Status: Single Military Obligations: Completed RESEARCH INTERESTS My current research interests are focused on Data Mining Clustering of High Dimensional Databases, using stochastic techniques of search. In particular I’m working on: • Data Mining and Knowledge Discovery  Clustering Massive and High-Dimensional Real-World Databases  Classification  Prediction • Bio-inspired Algorithms (Evolurionary Algorithms, Genetic Algorithms) • Bioinformatics • Climate Change Modelling • Earthquake Prediction Analysis • Large-scale Parallel and Distibuted Computing • Computational Geometry • Advanced Data Structures (k-trees, R-trees, etc) • Sensor networks: monitoring techniques, tracking and prediction approaches in sensor networks FOREIGN LANGUAGES • English - Level: Excellent
  3. 3. EDUCATION Oct. 2000 – PhD in Computer Science Aug. 2005 Department of Computer Science, School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, United Kingdom Duration Enrolled in October 2000 - Graduated in November 2005. PhD Thesis • Title: “Data Mining Clustering of High Dimensional Databases with Evolutionary Algorithms”. • Supervisors: Dr. Phil W. Trinder (Senior Lecturer at Heriot-Watt University – Web: http://www.macs.hw.ac.uk/~trinder) and Dr. Ali M.S. Zalzala (CEO of FZ-LLC Teresol Web: http://www.zalzala.info/ and http://www.teresol.com/). • Examiners: Dr. Alex Alves Freitas (Senior Lecturer at the University of Kent – Web: http://www.cs.kent.ac.uk/people/staff/aaf/index.html) and Dr. Nicholas Kenelm Taylor (Senior Lecturer at Heriot-Watt University – Web: http://www.macs.hw.ac.uk/~nick). Abstract Driven by advances in data collection and storage, increasingly large and high dimensional datasets are being stored. Without special tools, human analysts can no longer make sense of such enormous volumes of data. Hence, intelligent data mining (DM) techniques are being developed to semi-automate the process of mining nuggets of hidden knowledge, and extract them in forms that can be readily utilised in areas such as decision support. Clustering high dimensional data is especially challenging due to the inherent sparsity of the dataspace. Evolutionary algorithms (EAs) are a promising technique for DM clustering as population-based searches have intrinsic search parallelism, their stochastic nature avoids local optima and recovers from poor initialisation. This thesis investigates the use of evolutionary algorithms to effectively and efficiently mine clusters from massive and high dimensional numerical databases. The fundamental question addressed by this thesis is: can a stochastic search cluster large high dimensional datasets, and extract knowledge that conforms to the important requirements for DM clustering? Experimental results on both artificial and real-world datasets lead us to conclude that it can. The thesis proposes a novel EA methodology for DM clustering with the following three phases. Firstly, a sophisticated quantisation algorithm (TSQ: Two-Stage Quantization) imposes a uniform multi-dimensional grid onto the dataspace to reduce the search combinations. TSQ quantises the dataspace using a novel statistical analysis that reflects the local data distribution. It determines an appropriate grid resolution that enables the discrimination of clusters, while preserving accuracy and acceptable computational cost. Secondly, a novel EA (NOCEA: Non-Overlapping Clustering with an Evolutionary Algorithms) discovers high quality clustering rules using several novel semi- stochastic genetic operators, an integer-valued encoding scheme, and a simple data coverage maximisation fitness function. Both TSQ and NOCEA rely on a novel statistical analysis (UDA: Uniform-region Discovery Algorithm) identifying flat density regions (U-regions) in univariate histograms. U-regions detected in orthogonal uni-dimensional projections are ``signatures'' of clusters being embedded in higher dimensional spaces. Thirdly, a post-processing simplification phase that removes irrelevant dimensions (subspace clustering) and assembles the clusters. The thesis also explores task parallelism for several genetic operations to improve scalability when the data to be mined is large and high dimensional. NOCEA is a generic and robust clustering algorithm that meets the key DM clustering criteria. The following properties of NOCEA are demonstrated on both benchmark artificial datasets, and in a substantial real-world case study clustering the seismic activity associated with the active crustal deformation along the African-Eurasian-Arabian tectonic plate boundary. NOCEA produces interpretable output in the form of disjoint and axis-aligned hyper-rectangular clustering rules with homogeneous data distribution; the output is minimised for ease of comprehension. NOCEA has the ability to discover homogeneous clusters of arbitrary density, geometry, and data coverage. NOCEA effectively treats high dimensional data, e.g. 200 dimensions, and it effectively identifies subspace clusters being embedded in arbitrary subsets of dimensions. NOCEA has near linear scalability with respect to the database size, and both data and cluster dimensionality. NOCEA has substantial potential for task parallelism, e.g. reaching a speed up of 13.8 on 16 processors. NOCEA produces similar quality results irrespective initialisation and order of input data. NOCEA is exceptionally resistant to background noise. Finally, NOCEA has minimal requirements for a priori knowledge, and does not presume any canonical distribution of the input data.
  4. 4. Sept. 2000 – Certificate of Attendance of the Teaching Training Program “Developing Postgraduates’ Oct. 2000 Teaching Skills” Heriot-Watt University, Edinburgh, United Kingdom Duration Enrolled in September 2000 - Graduated in October 2000. Main Subjects Completed a formal training program at Heriot-Watt University, Edinburgh, Scotland, United Kingdom. The aim of the program was to develop tutor skills that are required for teaching postgraduate/undergraduate students in the modern sciences. Sept. 1999 – MSc in Distributed and Multimedia Information Systems (DMIS) Sept. 2000 Department of Computing and Electrical Engineering, Heriot-Watt University, Edinburgh, United Kingdom Grade Average coursework and written exams performance 76% and 65%, respectively. Duration Enrolled in September 1999 - Graduated in November 2000. Main Subjects The aim of the full-time MSc course DMIS is to impart the skills, methodologies and techniques required for the design and realisation of advanced information systems. DMIS offers an one-year taught course consisting of a diverse set of modules enabling students to acquire a firm grounding in database systems, software development, distributed systems, multimedia and hypermedia, networking and interactive systems. The taught courses are: Information Systems Methodologies, Network Applications, Database Systems, Multimedia Design, Software Development (Java), Distributed System Programming, Advanced Database Systems, Multimedia Design. Dissertation (Grade 62%) • Title: “Performance Analysis of the Mouse Anatomy and Gene-Expression Database CORBA server”. • Supervisors: Dr. Albert Burger (Senior Lecturer at Heriot-Watt University – Web: http://www.macs.hw.ac.uk/~ab/) and Dr. Richard Baldock (Senior Scientist at Medical Research Council MRC – Web: http://genex.hgu.mrc.ac.uk/). • Examiners: Dr. Yiya Yang (Database design and implementation leader at MRC-Human Genetics Unit – Web: http://genex.hgu.mrc.ac.uk/. • Abstract: The Mouse Atlas Project at the MRC (Medical Research Council) Human Genetics Unit has reached the stage of testing a prototype of a gene-expression database. This database includes significant text data as well as image data, in the form of original images and mapped gene-expression patterns. To be able to optimise the system configuration and plan for future hardware and software enhancements, a careful performance analysis of the CORBA server and the databases is required. The purpose of this project is, therefore, in the first instance to develop an appropriate benchmark test and to measure the system’s performance. This should lead to the identification of bottlenecks and a set of recommendations for performance improvements. • Goals: The industrial dissertation project was a collaboration between the Human Genetics Unit of the Medical Research Council (MRC) in Edinburgh and the Department of Computing and Electrical Engineering, at Heriot-Watt University, Edinburgh. The MRC (Medical Research Council, Web:http://www.mrc.ac.uk/) Human Genetics Unit (http://genex.hgu.mrc.ac.uk/) in Edinburgh is developing a digital atlas of mouse development and database to be a resource for spatially mapped data such as in situ gene expression and cell lineage. The project is in collaboration with the Section of Biomedical Sciences (http://www.bms.ed.ac.uk/) within the Division of Biomedical and Clinical Sciences at the University of Edinburgh. This research programme is the Edinburgh Mouse Atlas Project (emap). The emap Atlas is a digital Atlas of mouse embryonic development. It creates a series of interactive three-dimensional computer models of mouse embryos at successive stages of development with defined anatomical domains linked to a stage-by- stage ontology of anatomical names. The Gene Expression Database is being developed as part of the Mouse Gene Expression Information Resource (http://genex.hgu.mrc.ac.uk/MouseGeneExpInfoRes/) in collaboration with the Jackson Laboratory (http://www.jax.org/), USA.
  5. 5. Sept. 1995 – BSc in Applied Informatics Jun. 1999 Department of Applied Informatics, University of Macedonia Economic & Social Sciences, Thessaloniki, Greece Grade First Class Honours 8.54 out of 10. Duration Enrolled in September 1995 - Graduated in June 1999. Main Subjects The programme of studies included courses from the disciplines of Computer Science, Economics and Business Administration: Data Structures, Architecture and Evaluation of Computers, Object- Oriented Programming Operating Systems, Data Bases, Data Communications and Computer Networks, Computer Graphics, Human-Computer Interaction, Multimedia Systems, Mathematics of Computer Sciences, Applied Mathematics, Computer Systems Organization, Statistics, Microeconomic Analysis, Computational Mathematics, Accounting, Macroeconomic Models and Policies, Net Programming, Money and Financial Markets, Marketing, Analysis and Design Of Information Systems, Econometrics, Information Systems Management. Dissertation (First-class honours 10 out of 10) • Title: “Distance Learning Education: An Introduction to Personal Computer Hardware”. • Supervisors: Dr. Konstantinos G. Margaritis (Professor at University of Macedonia Economic and Social Sciences – Web: http://eos.uom.gr/~kmarg/margarit.html). • Examiner: Dr. Manos Roumeliotis (Assistant Professor at University of Macedonia Economic and Social Sciences – Web: http://www.etl.uom.gr/mr/). • Goals: The aim of the project was to design and implement an educational website supporting distance learning for the hardware of personal computers. Mar. 2006 – Certificate of Completion of a Training Programme for Computerized Military Applications Apr. 2006 School of Research and Informatics Officers of the Hellenic Army (SAEP), Division of Research and Information Technology (DEPLH), Hellenic Army General Staff, Ministry of Defense, Athens, Greece Enrolled in March 2006 - Graduated in April 2006. Completed a formal training program imparting experience and expertise for operating Advanced Computerized Military Applications including information collection and distribution, Command and Control System, plan testing, battle field monitoring, training, personnel and material management WORK EXPERIENCE Feb. 2006 – Information Systems Designer/Developer, Network Administrator/Developer, Programmer Feb. 2007 Center of Information Technology Support of Greek Army (KEPYES), Division of Research and Information Technology (DEPLH), Hellenic Army General Staff, Ministry of Defense, Athens, Greece Designing and developing advanced Information Systems to support various administrative tasks of the Greek Army. Improving and administrating military computer networks providing the backbone infrastructure for secure and effective communication of the Greek Armed Forces. Oct. 2000 – Laboratory and Tutorial Teacher, Aug. 2005 Department of Computer Science, School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, United Kingdom Teaching (on average 10 hours per week in the winter trimester) undergraduate and postgraduate students on selected Multimedia Design/Implementation topics including: Adobe Photoshop, Macromedia Flash, Macromedia DreamWeaver, Macromedia Director, and Adobe Premier. Sept. 2002 – Supervisor in a First-Class Final Year Undergraduate Dissertation Project Sept. 2003 Department of Computer Science, School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, United Kingdom Acting as co-supervisor in a first-class final-year undergraduate project with title “Distributing Evolutionary Data Mining Algorithms”, Department of Computer Science, School of Mathematical and Computer Sciences, Heriot-Watt University. Sept. 1998 – Accounting in Medium Sized Jewellery Company Mar. 1999 Department of Applied Informatics, University of Macedonia Economic and Social Sciences, Thessaloniki, Greece Acting as trained accounting employee in a medium sized jewellery company owned by Mr. Christos Karipoglou, Agias Sofias 104, Thessaloniki, Greece. The practical training was an industrial collaboration between the Department of Applied Informatics, University of Macedonia
  6. 6. Economic and Social Sciences and the above named company. The project was solely funded by the European Union. PUBLICATIONS & PARTICIPATION in INTERNATIONAL CONFERENCES • I. A. Sarafis, Data Mining Clustering of High Dimensional Databases With Evolutionary Algorithms.NOCEA: Phd Thesis, Department of Computer Science, Heriot-Watt University, Edinburgh, Scotland, United Kingdom, Sept 2005. • I. A. Sarafis, P. W. Trinder and A.M.S Zalzala. NOCEA: A Rule-based Evolutionary Algorithm for Efficient and Effective Clustering on Massive High-dimensional Databases. International Journal of Applied Soft Computing (ASOC), Elsevier Science, 2006. (Invited Paper)(In press) • I. A. Sarafis, P. W. Trinder and A.M.S Zalzala. Towards Effective Subspace Clustering with an Evolutionary Algorithm. Proceedings of the IEEE Congress on Evolutionary Computation (CEC03), pp 797--806, Canberra, Australia, 2003. • I. A. Sarafis, P. W. Trinder and A.M.S Zalzala. Mining Comprehensive Clustering Rules With an Evolutionary Algorithm. Proceedings of Genetic and Evolutionary Computation Conference (GECCO03), pp 2301--2312, Chicago, USA, 2003. (Nominated for Best Paper Award) • I. A. Sarafis, A.M.S Zalzala and P. W. Trinder. A Genetic Rule-based Data Clustering Toolkit. Proceedings of the IEEE Congress on Evolutionary Computation (CEC02), pp 1238--1243, Honolulu, USA, 2002. • A.M.S. Zalzala, A. AlZain and I. A. Sarafis. A Data Mining Tool Using An Intelligent Processing System with a Clustering Application. Proceedings of the 5th International Conference of Adaptive Computing in Design and Manufacturing, (ACDM02), Devon, United Kingdom, 2002. • I. A. Sarafis. Attendance of the Scottish Higher Education Funding Council (SHEFC) Workshop on “Design Cluster For System Level Integration” established in the Universities of Edinburgh, Glasgow, Heriot-Watt, Strathclyde and the Institute for System Level Integration with support from SHEFC under the Research Development Programme, Edinburgh, Scotland, United Kingdom, Janurary 2002. • I. A. Sarafis. Efficient and Effective Data Mining Clustering on Massive-High Dimensional Datasets using Evolutionary Algorithms. Postgraduate Seminar, Department of Computer Science, Heriot-Watt University, Edinburgh, Scotland, United Kingdom, June 2004. • I. A. Sarafis. ”Performance Analysis of the Mouse Anatomy and Gene-Expression Database CORBA server”. MSc Dissertation, Department of Computing and Electrical Engineering, Heriot-Watt University, Edinburgh, Scotland, United Kingdom, Sept. 2000.
  7. 7. GRANTS & SCHOLARSHIPS • The Department of Mathematical and Computer Sciences (http://www.macs.hw.ac.uk), Heriot-Watt University, Edinburgh, Scotland, United Kingdom offered an International Travel Grant (£600) supporting the attendance of the Genetic and Evolutionary Computation Conference (GECCO-2003) (http://gal4.ge.uiuc.edu: 8080/GECCO-2003/) July 2003, Chicago, USA. • The American Association for Artificial Intelligence (AAAI) (http://www.aaai.org), Menlo Park, California CA 94025-3442, USA, offered a GECCO-2003 Student Travel Award supporting the participation in the Genetic and Evolutionary Computation Conference (GECCO-2003) (http://gal4.ge.uiuc.edu: 8080/GECCO-2003/) July 2003, Chicago, USA. • The Development and Alumni Office Fund Committee, Heriot-Watt University, Edinburgh, Scotland, United Kingdom (http://www.hw.ac.uk/wattclub/) offered an Alumni Fund Grant to support the presentation of the paper “Mining Comprehensive Clustering Rules with an Evolutionary Algorithm" (nominated for best paper award) in the prestigious Genetic and Evolutionary Computation Conference (GECCO-2003) (http://gal4.ge.uiuc.edu: 8080/GECCO-2003/) organized by the International Society for Genetic and Evolutionary Computation (ISGEC) (http://www.isgec.org/), July 2003, Chicago, USA. • The IEEE Neural Networks Society (http://www.ieee-nns.org/) offered a Student Travel Grant covering accommodation cost (five hotel nights in the Hilton Hawaiian Village Hotel) and registration fees for the International conference IEEE World Congress on Computational Intelligence and Congress of Evolutionary Computation (CEC-2002), (http://www.wcci2002.org) May 2002, Honolulu, Hawaii, USA. • The Royal Academy of Engineering, Westminster, London, United Kingdom (http://www.raeng.org.uk/) offered an International Travel Grant (Grant No. IJB/AH/ITG 02-148) (£600) supporting the attendance of the premier conference IEEE World Congress on Computational Intelligence and Congress of Evolutionary Computation (CEC-2002), (http://www.wcci2002.org) May 2002, Honolulu, Hawaii, USA. • The Heriot-Watt University offered (Ref. No. RVP/AF, 000303433) a three-year (2000-2003) Partially- Funded Postgraduate Research Scholarship covering tuition fees (£2,675 per year) and maintenance (£2,000 per year) for undertaking a PhD research in Computer Science, Department of Computer Science, School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, United Kingdom. • The Student Awards Agency for Scotland (SAAS) (http://www.student-support-saas.gov.uk/) sponsored (Grant No. SAAS P/11388781) the tuition fees (£2,675) for attending the one-year (1999-2000) full-time MSc in DIMS, Department of Computing and Electrical Engineering, Heriot-Watt University, Edinburgh, United Kingdom. SKILLS PROFILE Team Work and Communication According to both academic and industrial collaborators (reference letters are available upon request), my personality is classified as teamworker, self-motivated, reliable, hardworking, honest, sociable, and highly cooperative. I like to describe my working style as open, pragmatic, diplomatic and objective. Environments that I thrive in give a lot of freedom to each individual while keeping the team tight. I have presented my research at international conferences and worked on projects interfacing with specialists. The experience I have gained has taught me to be adjustable and to communicate efficiently with people of different backgrounds and origins. The six-years exposure to a demanding global research environment, the participation in prestigious international conferences presenting my novel papers (one of them was Nominated for Best Paper Award in the Proceedings of Genetic and Evolutionary Computation Conference (GECCO03), pp 2301--2312, Chicago, USA, 2003.), and finally, my superb written and spoken English, support strong communication and presentation skills. Technical Skills Professional and academic/research experience has given me the ability to use a range of implementation tools and to assimilate new technologies and ideas quickly. One of my key technical skills is devising intelligent data mining techniques for clustering massive and high-dimensional databases. The last six years I have been developing software tools (solely written in Java) with leading edge technologies. According to world-established academic/industrial referees my technical skills are summarized as: I’m well equipped to both teach and to undertake research. I’m a very capable researcher with the ability to synthesise research areas and to generate novel ideas coupled to the tenacity to solve hard real-world problems. I have developed superb technical writing skills and I’m developing an impressive publication record having authored several high quality journal and international conference papers, where many of them were nominated as best student papers. I have excellent technical skills with a broad Computer Science background, coupled with world- leading expertise in data mining and evolutionary computing research. I’m a very capable software/multimedia engineering, with expertise in several languages. These skills are an excellent basis for teaching a range of Computer Science subjects at different undergraduate and postgraduate levels. I have successfully completed a formal teaching training program leading to a certificate as an Approved Heriot-Watt University Tutor. I have an extensive teaching experience spanning for more than six years at Heriot-Watt University as both tutor and supervisor in final-year undergraduate dissertation projects.
  8. 8. Development Tools and Technologies • Operating systems: Windows, Linux, Unix, Dos, Solaris, Mac • Programming languages: Excellent knowledge of Java (Sun Microsystems) and all related technologies, JDBC-ODBC, Object Oriented Programming, Networking, RMI, CORBA, Distributed and Parallel Computing, Sockets, cgi, C++, C, Pascal, Basic, Delphi, Visual Basic, HTML, UML • Database tools: Oracle, Access, Object Oriented Databases e.g. ObjectStore • Other packages: Microsoft Word, PowerPoint, Vision, FrontPage, Access, Excel, Unix development tools e.g. latex, xfig • Multimedia packages: Adobe Photoshop, Macromedia Flash, Macromedia Dreamwaver, Macromedia Director, Adobe Premiere REFEREES PhD Supervisor: CEO of Technology and Research Solutions - Teresol: Dr. Phil W. Trinder Dr. Ali M.S. Zalzala Senior Lecturer CEO Technology and Research Solutions FZ-LLC (Teresol) School of Mathematical & Computer Sciences, P.O. Box 500735 Heriot-Watt University, Riccarton Campus, Dubai, UAE Edinburgh EH14 4AS, Scoltand, United Kingdom [e-mail] : admin@teresol.com [e-mail] : P.W.Trinder@hw.ac.uk [Telephone] : +971 (4) 362 5219 [Telephone] : +44 (0) 131 451 3435 [Fax] : +971 (4) 362 5329 [Fax] : +44 (0) 131 451 3327 [Personal Web] : http://www.zalzala.info [Personal Web] : http://www.macs.hw.ac.uk/~trinder [Company Web] : http://www.teresol.com PhD Examiner: Dr. Nicholas Kenelm Taylor Senior Lecturer and Deputy Head of Computer Sciences Department, School of Mathematical & Computer Sciences, Heriot-Watt University, Riccarton Campus, Edinburgh EH14 4AS, UK [e-mail] : N.K.Taylor@hw.ac.uk [Telephone] : +44 (0) 131 451 3436 [Fax] : +44 (0) 131 451 3327 [Personal Web] : http://www.macs.hw.ac.uk/~nick

×