Study plan

Course Schedule

The classes start at 9am until 5pm with lunch and coffee breaks
The student will chose 2 classes per day

Course Schedule

Courses

General Courses

R for Data Science

 

In this course you will learn how to program in R and how to use R for effective data analysis with a  particular focus on using R for software development in a data science setting. You will learn how to leverage  packages necessary for a statistical programming and understand generic programming language concepts to use in a high-level statistical language. The course covers basic R language fundamentals, key concepts, and practical issues in statistical computing which includes programming in R, reading data into R, accessing R packages, writing R functions, debugging, profiling R code, and organizing and commenting R code. Course will provide hands on working examples.

Tools - R/RStudio

 Necessary skills to participate in the course - Analytical thinking, desire to learn

Instructor

Dr. Margarit Khachatryan

US, Founder and CEO of MagAnalytics

Python for Data Science

This course aims to provide the basics of programming in Python with a special focus on data analysis. The participants will learn how to leverage Pandas for reading, manipulating and summarizing data, how to visualize results using Matplotlib and Seaborn. Numpy and Scipy  packages will be introduced for scientific computing in Python.

Instructor

Dr. Hrant Davtyan

Armenia, Assistant Professor of Data Analytics, AUA

Intro to Inferential Statistics

Inferential statistics allows to draw conclusions from data that might not be obvious. This course focuses on enhancing participants’ ability to develop statistical hypotheses and use both parametric and non-parametric hypothesis testing tools like t-tests, Chi-square and other tests to validate proposed claims.

Tools - R

Necessary skills to participate in the course - Basic Statistics (measures of central tendency and variation). Basic programming skills in R

Instructor

Vahe Movsisyan

Armenia, Central Bank of Armenia,  AUA, Co-founder of Datamotus LLC

Intro to Machine Learning

Elective Courses

Introduction to Urban Data Science

In this course, attendees will be provided with an overview perspective on location-based technologies with a focus on urban applications. From human mobility models and
theory to neighborhood detection and finding an appropriate location for placing a shop or amenity attendees will be introduced to the mechanics and building blocks of a number of research works with a strong application orientation. Historically important theoretical concepts in the area will be covered that include Central Place Theory and Reilly’s law of retail gravitation. Next, the spotlight will be put on newer advancements in the area of the so-call gig economy, introducing services like Uber and Airbnb, highlighting opportunities for research using corresponding transport and hospitality datasets.

Tools - Python, Mapbox

Instructor 

Dr. Anastasios Noulas

USA, Assistant Professor and Faculty Fellow, Center of Data Science, New York University

Natural language Processing (NLP)

The intent of the course is to present an introduction to Natural Language Processing, the study of computing systems that can process, understand, and analyze human language. The primary focus of the course will be on understanding various NLP tasks, such as information extraction and information retrieval, discussing algorithms for effectively solving these problems, as well as methods for evaluating their performance.

Tools - Python, Perl

Necessary skills to participate in the course - Basic programming skills, as well as reasonable knowledge of mathematics and statistics.

Instructor 

Dr. Lana Yeganova

USA, Scientist at NIH

Introduction to Computer Vision and Image Processing

This course will introduce the students to the world of Computer Vision, starting with a brief history of the field all the way to the recent advances of image processing. We will be using the famous open source library OpenCV to load and manipulate images.

Tools - OpenCV

Necessary skills to participate in the course - Basic programming skills (preferably in Python)

Instructor 

Mr. Christophe Zoghbi

Lebanon, Computer Vision & Machine Learning Consultant

Deep Learning
Time Series Analysis

The course will provide a comprehensive introduction to time series  visualization, modeling and forecasting. It will teach the R language functions for time series analysis. Main topics include data decompositions, classical forecasting models such as exponential smoothing and ARIMA, hypothesis testing for trend and seasonality detection, algorithms for period detection. The main point of discussions should be clarifying the difference between deterministic and stochastic characteristics of time series.

Tools - Powerpoint, Rstudio

Necessary skills to participate in the course - R language, calculus, statistics, probability

Instructor 

Dr. Arnak Poghosyan

Armenia, VMware

Deep Learning in Computer Vision

Deep Learning adds a huge boost to the field of Computer Vision. In this course, we will introduce a special type of Neural Networks called Convolutional Neural Network, that are used to extract features from images. We will used these networks to build image classification and object detection models.

 

Tools - Jupyter notebooks (or Google Colab), Tensorflow/Keras

Necessary skills to participate in the course - Basic knowledge of how Neural Networks work, basic programming background

Instructor 

Mr. Christophe Zoghbi

Lebanon, Computer Vision & Machine Learning Consultant

Theoretical and Applied Bioinformatics

An enormously large series of complex and chaotic events have shaped the genomes of eukaryotes, prokaryotes and viruses. This course will address cutting edge approaches for the computational investigation of these events, with an eye toward developments in translational systems biology. The course will present the fundamentals of genome analysis and data management focusing on data search and retrieval, similarity searches, genome assembly and annotation, metagenomics and comparative genomics.

 

Tools - NCBI online resources (Entrez, BLAST, SRA), Python

Necessary skills to participate in the course - Basic concepts of Molecular Biology

Instructor Profile

Dr. Tatiana Tatusova

USA, Senior Scientist, NCBI/NLM/NIH

Recommendation Systems

The course will cover the general ML approaches used for recommendation systems (memory based and model based, explicit and implicit user feedbacks, etc). It will cover different algorithms, from very basic to more advanced - Matrix Factorization, Factorization machines, Neural Networks based, Hybrid recommenders, etc.

Industry specific/domain specific

Introduction to Survival Analysis with Use Cases

During the lecture I'm more than excited to walk participants through basics and definitions of survival analysis. We will walk step by step by the most crucial parts of the data analysis procedure in the scenario of time-dependent data and areas of censored information. There will be a place to learn diagnostic plots used in the formulation and verification of cox proportional hazards model, which is the most know procedure applied to churn analytics. If there is a time, we can consider model for multi-state nature of the survival analysis.

 

Tools - R, RStudio

Necessary skills to participate in the course - Basic idea of R usage and basic knowledge about linear models (like linear regression)

Instructor 

Mr. Marcin Kosinski

Poland, Statistician at Gradient Metrics LLC

Web Scraping in R

During the two days, the students should be able to write a web scraping code in R (using packages (I) RSelenium, (II) httr and XML or (III) rvest).

The collected data from web would be processed using dplyr package. The students will learn how to use ggmap package to enrich their data with geographical information and create cool maps. Besides, we will use ggplot2 and the basic graphics for the graphical exploratory data analysis.

During the course, I will use the following websites for scraping:
1. www.immoscout24.de – P1 GOAL: Figure out the rent prices in German counties. Find the variables, which could explain the higher price level on the flat rent market (For example, we could investigate the rents in Berlin area).
2. www.betexplorer.com – P2 GOAL: Figure out if there is a possibility for risk-less profit by allocation of the bet between different bookies for the same game.

After accomplishing this course, students should be able to write their very own R scraper to collect data and answer the research question they are interested in. Further projects with an ongoing remote supervision is possible (e.g. www.sas.am, www. parliament.am, www.rate.am, www.list.am etc).

Tools - Slides/tasks. We are going to code a lot together!

Necessary skills to participate in the course  - Solid Background in R (dplyr, stringr), HTML and Fundamentals of Econometrics

Instructor 

Dr. Andranik Stepanyan

Germany, Senior Data Analytics Consultant

 

Data Science for Health Care

Large amounts of heterogeneous data are becoming available in medicine and healthcare, from electronic health records to genomic data, and data collected by various wearable devices. The scale, diversity and complexity of these datasets present serious challenge in analyses and subsequent practical applications. In this course, you will learn about successful case studies with state-of-the-art applications of big data and artificial intelligence in healthcare. You will get familiar with some of the different types of data and machine learning methods involved in data science for healthcare and will have a hands-on experience of working with such data.

 

Tools - Jupyter Notebook, Python, scikit-learn

Necessary skills to participate in the course  - Python

Instructor 

Dr. Harutyun Shahumyan

Ireland, Visiting Research Fellow at University College Dublin and Senior Data Scientist at United Health Group

Building your Data Science Platform with AWS

In this course you learn how to build up your data science platform with AWS. You will learn how to control costs and which services help you to prepare your data for later application of ML application.

 

Tools - AWS, PyCharm, VIM,

Necessary skills to participate in the course  - Linux, Python and SQL basics, a general understanding of hardware and networks

Instructor 

Mr. Stephen Papp

Austria, Lecturer and Entrepreneur

Text Classification with Fasttext

The first part of the course aims to familiarize the participants with Facebook's FastText algorithm. In the second part we will implement the algorithm in tensorflow.

 

Tools - Laptop with Python and Tensorflow pre-installed

Necessary skills - Python, Tensorflow, familiarity with text classification will help

Instructors

Dr. Margarit Khachatryan

Dr. Margarit Khachatryan
US, Founder and CEO of MagAnalytics

Fields of Interest: Analytics - Operations Research, Data Science

 

Dr. Maga Khachatryan is a Founder & CEO of MagAnalytics, a Data Science and Operations Research consulting company which helps businesses to make better decisions through the use of predictive and prescriptive analytics. Throughout her career Dr. Khachatryan led a number of analytics projects for Fortune 500 Companies and startups. She was instrumental in developing and successfully delivering complex mathematical models that ultimately optimized business performance and generated value. Dr. Khachatryan received her PhD in Industrial Engineering from Georgia Institute of Technology and her Master degree from the American University of Armenia.
​​Dr. Khachatryan is actively involved in serving Data Science and Operations Research community.  She is elected President for Women in Operations Research and Management Science (WORMS) at the Institute for Operations Research and the Management Sciences (INFORMS), initiated an award to support WORMS,  she also chairs and organizes scientific sessions and practitioners colloquiums at the INFORMS Meetings, mentors young professionals and students who are interested in STEM. She is also actively engaged in Analytics Society of INFORMS. Partnering with  Stanford University Dr. Khachatryan became an Ambassador of Women in Data Science in St. Louis, MO, USA (WiDS St. Louis) to inspire, educate and support women in the field and provide every woman with career and growth opportunities.

US, Founder and CEO of MagAnalytics

Dr. Hrant Davtyan

Dr. Hrant Davtyan
Armenia, Assistant Professor of Data Analytics, AUA

Fields of Interest - Deep Learning, NLP, Reinforcement learning, Remote Sensing

 

Dr. Davtyan is an Assistant Professor of Data Science and Business Analytics at AUA. He has developed and is currently teaching courses on Data Scraping, Business Analytics and Business Intelligence. His research interests range from deep learning and NLP to reinforcement learning to remote sensing. Dr. Davtyan has an extensive experience of providing consultancy services to private companies ranging from IT to finance and healthcare sector as well as the public sector as a contracted expert for UN agencies. He is the founder of Metric data science consultancy and machine learning research company providing wide range of analytical and product development services to local and international partners.

Armenia, Assistant Professor of Data Analytics, AUA

Dr. Anastasios Noulas

Dr. Anastasios Noulas
USA, Assistant Professor and Faculty Fellow, Center of Data Science, New York University
Fields of Interest - Human Mobility, Location Intelligence,  Urban Systems Anastasios is a Moore-Sloan Data Science Fellow at the Center for Data Science, New York University.  Anastasios completed his PhD in 2013 in the Network and Operating Systems group at the Computer Laboratory in the University of Cambridge.  His main research interests include human mobility modelling, social network analysis, location-based services and disruptive technologies. During his PhD he has published at top-tier conferences and journals in the fields of data mining, online social systems and user behaviour modelling, combining machine learning and complex systems techniques. Prior to joining New York University, Anastasios was also a Data Scientist at Foursquare Labs in New York and Telefonica Research, Madrid. Over the past years he has collaborated with a number of Startups including retail analytics provider PiinPoint, whereas in 2015 he has teamed up with researchers in Cambridge and Belgium to launch the OpenStreetCab project.

USA, Assistant Professor and Faculty Fellow, Center of Data Science, New York University

Dr. Lana Yeganova

Dr. Lana Yeganova
USA, Scientist at NIH
Fields of Interest - Machine Learning, Data Science

USA, Scientist at NIH

Mr. Christophe Zoghbi

Mr. Christophe Zoghbi
Lebanon, Computer Vision & Machine Learning Consultant

Fields of Interest - Deep Learning, Computer Vision, Reinforcement Learning, Astrophysics

 

Christophe is a Software Engineer and Technical Manager with over 8 years of experience in development and various fields of Data Science and Artificial Intelligence. After working with mobile application development for 5 years, he transitioned to the field of Machine Learning and worked as a consultant on many projects, involving Computer Vision and Natural Language Processing problems.
He is also the founder of Beirut AI, the applied Artificial Intelligence community in Lebanon, where he organizes community events and technical workshops to help people understand and apply Artificial Intelligence. He was also the organizer of Papers We Love, Lebanon chapter, a community for reading and discussing academic computer science papers.

Lebanon, Computer Vision & Machine Learning Consultant

Dr. Arnak Poghosyan

Dr. Arnak Poghosyan
Armenia, VMware

Fields of Interest - Approximations and  Interpolations, Fourier Analysis, Convergence of series, Time series analysis, machine learning, neural networks

 

Arnak Works at VMware since 2010. He also delivers lectures at AUA and YSU. Arnak is a specialist  in the area of time series analysis, machine learning, anomaly detection. He is an author of 40 US patents and more than 40 scientific papers.

Armenia, VMware

Mr. Stephen Papp

Mr. Stephen Papp
Austria, Lecturer and Entrepreneur

Fields of Interest - Big Data, Startups, Cloud Computing, Python, AWS

 

Lecturer on Data Engineering and Cloud Computing

Austria, Lecturer and Entrepreneur

Dr. Tatiana Tatusova

Dr. Tatiana Tatusova
USA, Senior Scientist, NCBI/NLM/NIH

Fields of Interest - Bioinformatics

 

Tatiana A. Tatusova, Ph.D is a Senior Scientist at the National Center for Biotechnology Information (NCBI), National Institutes of Health.
She is leading a group of software developers and biologists working on Genomic sequence analysis, data management and curation, and visualization tools. Dr. Tatusova received her bachelor and Master of Science degrees from Department of Physics at Moscow State University and her Ph.D. in Biophysics from Department of Biology at Moscow State University. Her research interests focus on computer-assisted comparative genomics, developing software tools for analyzing and representing the bio-molecular sequence data and developing interactive WWW sites.

USA, Senior Scientist, NCBI/NLM/NIH

Mr. Marcin Kosinski

Mr. Marcin Kosinski
Poland, Statistician at Gradient Metrics LLC

Fields of Interest - Churn Analysis

 

Marcin Kosiński has a master degree in Mathematical Statistics and Data Analysis specialty. Challenges seeker and devoted R language enthusiast. In the past, keen on the field of large-scale online learning and various approaches to personalized news article recommendation. Community events host: co-organizer of the +1600 members R Enthusiasts meetups in Warsaw and the main organizer of the Polish R Users Conference 2017 and 2018 (‘Why R? 2017’ whyr.pl). Interested in R packages development and survival analysis models. He worked as a subject matter expert at +3000 members Data Crunchers Online R Course at The Warsaw School of Data Analysis. Currently explores and improves methods for quantitative marketing analyses and global surveys at Gradient Metrics.

Poland, Statistician at Gradient Metrics LLC

Dr. Andranik Stepanyan

Dr. Andranik Stepanyan
Germany, Senior Data Analytics Consultant

Fields of Interest - Spatial Analysis, Hedonic Regressions, Web Scraping, Data Manipulation

 

I am passionate about collection, cleaning, processing and mining of large amounts of data using R and Python.
Currently, I am working at Deloitte Germany as a Senior Data Analytics Consultant in projects linked to Mergers & Acquisitions, Carve-outs and Reorganization of business entities. Before joining Deloitte, I was a postdoc researcher at the University of Trier, where I was involved in research projects on the disentangling of agglomeration forces and measurement of spatial concentration in Germany. Besides, I have consulted local governments and institutions in matters concerning sustainable regional development and labor market research as a freelance consultant.

Germany, Senior Data Analytics Consultant

Dr. Harutyun Shahumyan

Dr. Harutyun Shahumyan
Ireland, Visiting Research Fellow at University College Dublin and Senior Data Scientist at United Health Group

Fields of Interest - Artificial Intelligence, Data Science, Geographic Information Science

 

Harutyun Shahumyan is a Marie Curie Alumni and currently works as a Senior Data Scientist at United Health Group in Ireland. Harutyun holds an Honours Master’s Degree in Applied Mathematics and a Ph.D. in Engineering. His expertise includes machine learning, deep learning, statistics, modelling and GIS. Harutyun has over 12 years academic experience working at University College Dublin, University of Maryland, University of Cambridge and National Academy of Sciences of Armenia. He has over 50 publications focused on geospatial modelling, model integration, predictive analytics and application of decision support systems.

Ireland, Visiting Research Fellow at University College Dublin and Senior Data Scientist at United Health Group

Dr. Arsen Yeghiazaryan

Dr. Arsen Yeghiazaryan
Webb Fontaine, Machine Learning Specialist

Fields of interest - Deep Learning, Recommendation Systems, Clustering, Tree-based algorithms

 

Experienced Scientific Researcher with a demonstrated history of working in the fields of Machine Learning and Theoretical Physics. Skilled in Math, Statistics, Machine Learning theory, Quantum Field Theory, Python, Programming. Strong research professional with a Doctor of Philosophy (Ph.D.) focused in Theoretical and Mathematical Physics.

Webb Fontaine, Machine Learning Specialist

Mr. Sipan Muradyan

Mr. Sipan Muradyan
Machine Learning Specialist at Webb Fontaine, Armenia

 

Fields of interest - Computer Vision, NLP

AUA alumni, working in the field of ML for 3 years. My ML career started in Picsart. I worked for 2 years on image related tasks (classification, similarity, detecting duplication). At Webb Fontaine, we are currently working on 2 projects, which are fraud detection and text classification.

Machine Learning Specialist at Webb Fontaine, Armenia

Dr. Vahe Movsisyan

Vahe Movsisyan
Armenia, Central Bank of Armenia, AUA, Co-founder of Datamotus LLC

Fields of Interest - Data Mining and Statistical Analysis

 

I teach Applied Statistics (since 2013) and Data Mining (2019) courses at AUA. I received a diploma in Economics from Armenian State Institute of Economics in 2004 and was awarded a PhD. degree in Economics by Moscow State University of Economics, Statistics and Informatics (MESI) in 2007.
I started my professional career as a financial market analyst at the Monetary Police Department (2005-2006) of the Central Bank of Armenia, after moved to Statistics Department (2007-2015). In 2015 I’ve joined Economic Research Department. I’ve participated and presented methodological and applied research papers in numerous national and international conferences. I’m a member of International Statistical Institute (ISI) and International Association of Survey Statisticians (IASS).
I’m a co-founder of two Datamotus LLC (Armenia) and Mychoice LLC (Singapore).  Datamotus is a company that provides Data mining and statistical analysis services; Mychoice is an online market research tool that automates entire process of conjoint studies.

Armenia, Central Bank of Armenia,  AUA, Co-founder of Datamotus LLC