Module: Praktijkopdracht Thema 9: Introduction to Machine Learning

Gegevensveld Waarde
Osiriscode BFVH3TH9
ECTS 6
Toetsvorm Opdracht
Minimum cijfer 5,5
Docent(en) NOMI
Contactpersoon NOMI
Voertaal Nederlands

Cursusdoelen (leerdoelen)

After successfully completing the course, students will be able to

  • Perform an exploratory data analysis (EDA) on a dataset that serves as input data for machine learning;
    • Identify outliers and treat these correctly
    • Identify correlated attributes
    • Apply appropriate data transformations
    • Create visualizations that help understand the data and assess its quality
  • Carry out Machine Learning research:
    • Intelligently select, apply, optimize and evaluate ML algorithms;
    • Carry out a quality control on the aspects of code and algorithm performance using appropriate measures of quality;
    • Design and perform benchmarks to compare ML applications;
  • Publishing a ML algorithm to the scientific community by wrapping it into a Command-line Java application and publishing it on a well-documented git repository;
  • Communicate research results by means of work discussions, presentations, and a scientific report – focusing primarily on the sections Results and Discussion;
  • Translate the end result of her own project into a proposal for a subsequent research project, i.e. in one of the bioinformatics minors;
  • Display a well-founded critical attitude towards own and others' results as well as own professional functioning

Inhoud

In this quarter project you will get to know Data Mining (DM), also called Machine Learning (ML). These two terms are not quite the same but are often used as if they are. By the end of the course, you will know what the difference is.

You will work individually on a ML problem of your own choice, but one or more other students will analyze the same dataset. This will make troubleshooting easier and opens the possibility of peer review (analysis and code), which is also an important skill to master: providing help and critical feedback to colleagues. The research projects will either come from the public domain, or from the Hanze or one of its partners.

General organization Although the central theme is Machine Learning, several other topics and/or competences will be trained as well – in particular research skills. Therefore, the learning outcomes for the course are the following.

Assessment criteria

  1. Day-to-day professional functioning (25%)
    • Critical attitude towards own work and that of others
    • Peer review (giving and receiving!)
  2. Research log (Markdown) containing description of daily work - primarily dealing with the research phase (25%)
    • Reproducibility
    • Readability
  3. Wrapper application of final algorithm (25%)
    • Functionality
    • Code quality
    • Documentation + Readme.md
    • Git usage
  4. Report, focusing on the sections Results and Discussion and also assessing reporting technique (25%) The report should discuss
    • data exploration, data cleaning and transformations
    • Strategy and methodology of ML algorithm benchmarking and selection
    • Strategy and methodology of ML algorithm performance testing and optimization focusing on aspects of speed, memory usage, performance (accuracy, sensitivity etc)
    • Evaluation of final ML algorithm
    • Suggestions for future work

This Excel sheet shows you exactly how you will be graded. It has the form of a Rubrics grading system.

Literatuur en andere bronnen

Literatuur

Web

Competenties

1(III), 2(III), 3(III), 4(III), 5(III), 7(II), 9(II), 10(I)

Werkvormen

  • Praktijkopdrachten/Projectonderwijs

Ingangseisen

  • Het wordt sterk aangeraden ook de cursussen Introduction Datamining en Introduction to Java programming te volgen omdat deze cursussen essentiele kennis aanbieden en vaardigheden trainen voor het uitvoeren van de projectopdracht.

Ingangseisen toets

-

Voorkennis

  • Er wordt uitgegaan van een goede basis in statistiek (Statistiek 1+2 & 3) en goede beheersing van R (Informatica 4).

Voorkennis kan worden opgedaan met

-

Bronnen van zelfstudie

-

Verplicht materiaal

-

Aanbevolen materiaal

-

results matching ""

    No results matching ""