辅导案例-COMP219

欢迎使用51辅导,51作业君孵化低价透明的学长辅导平台,服务保持优质,平均费用压低50%以上! 51fudao.top
COMP219 - 2019 - First CA Assignment
Individual coursework
Simple Machine Learning Model
Assessment Information
Assignment Number 1 (of 2)
Weighting 10%
Assignment Circulated Thursday 10 October 2019
Deadline Friday November 8 2019, 15:00
Submission Mode Electronic
Learning outcome assessed 2. Ability to choose, compare, and apply suitable basic
learning algorithms to simple applications;
Purpose of assessment To implement machine learning algorithms on a dataset
Marking criteria The marking scheme can be found in Section 3
Submission necessary in order No
to satisfy Module requirements?
Late Submission Penalty Standard UoL Policy.
1
1 Objectives
This assignment requires you to implement and evaluate one (or multiple) simple machine
learning models on two datasets.
2 Requirement and Description
Language and Platform Python (version 3.5 or above). You can use some other libraries
available on Python platform, including numpy, scipy, scikit-learn, and matplotlib. If you
intend to use libraries other than these, please consult the demonstrator or the lecturer.
Dataset Please use one of the following two datasets, whose information can be found in
https://scikit-learn.org/stable/datasets/index.html:
• Optical recognition of handwritten digits dataset
• RCV1 dataset
Learning Task You can choose either classification (preferred) or regression.
Learning Model/Algorithm You may choose at least one learning algorithm from the
following list:
• decision tree learning
• naive Bayes
• k nearest neighbor
Assignment Tasks Once you have selected a learning algorithm, you need to implement
the following functionalities :
f1 provide the details of the dataset,
f2 train a machine learning model by calling an algorithm from the machine learning
libraries such as scikit-learn, and save the model so that it can be called later. You can
save a model with scikit-learn built-in functionality such as https://scikit-learn.
org/stable/modules/model_persistence.html.
f3 implement a machine learning algorithm by yourself, train a model with the algorithm,
and save the model so that it can be called later;
f4 compare the train error and test error of the two models;
f5 enable the user to query the saved models with e.g., an index of the test dataset.
2
Additional Requirements We have additional requirements that,
1. the marker can run your code directly, i.e., see the results of functionalities f1, f4, and
f5 by loading the saved models, without calling the training functionalities f2 and f3,
and
2. you need to provide clear instructions on how to train the two models, i.e., run func-
tionalities f2 and f3. The instructions may be e.g., a different command or an easy
way of adapting the source code.
Documentation You need to write a proper document
1. detailing how to run your program, including the software dependencies,
2. explaining how the functionalities and additional requirements are implemented, and
3. providing the details of your implementation, including e.g., the meaning of parameters
and variables, the idea of your algorithm, etc.
Also, the document needs to follow the guidelines in Note 1 of Section 3.
Submission files Your submission should include the following files:
• a file for source code,
• two files for saved models, and
• a document.
Please see Section 4 for instructions on how to package your submission files.
3 Marking Criteria
The assignment is split in a number of steps. Every step gives you some marks.
Note 1 At the beginning of the document, please include a check list indicating whether
the below marking points have been implemented successfully. The length of the submitted
document needs to be within 4 pages (A4 paper, 11pt font size).
Note 2 The marking of a functionality will also consider the quality of coding and the quality
of documentation. A run-able implementation alone will have up to 50% of the marks.
functionality f1: 20%
Successfully load the dataset and display the dataset information, including the number of
data entries, the number of classes, the number of data entries for each classes, the minimum
and maximum values for each feature, and the train dataset and test dataset split.
3
functionality f2: 20%
Successfully call library functions to train and save a model. There needs to be a corre-
sponding saved model in your submission.
functionality f3: 40%
You have an implementation of an algorithm that is able to train a model. There needs to
be a corresponding saved model in your submission. You cannot call any library which has
direct implementation of a machine learning algorithm.
functionality f4: 10%
Please output the train and test errors for both models. Each model have 5%.
functionality f5: 10%
Allow users to query the models by changing the input. For example, you can use a variable
to represent the index of the test dataset. Each model have 5%.
4 Deadline and Submission Instructions
• Deadline for submitting the first assignment is given at the beginning of this document.
• Please submit all the files in a single compressed file with the filename
′′〈studentnumber〉.tar′′ or ′′〈studentnumber〉.zip′′
For example, “201191838.tar” or “201191838.zip” if your student number is 201191838.
Submissions with other filename will not be accepted. Also, in the submission files,
please do not include your name.
• Submission is via VITAL Turnitin system.
4
51作业君

Email:51zuoyejun

@gmail.com

添加客服微信: abby12468