程序代写案例-BUCI057H7

欢迎使用51辅导,51作业君孵化低价透明的学长辅导平台,服务保持优质,平均费用压低50%以上! 51fudao.top
BUCI057H7 page 2 of 11


Question 1.a (10 marks)
Provost & Fawcett have defined Data Science in terms of 9 computational problems.
How would you introduce Regression to someone who already knows Classification?
Your answer:


BUCI057H7 page 3 of 11


Question 1.b (15 marks)
Explain the geometry that causes the so-called "curse of dimensionality."
Your answer:


BUCI057H7 page 4 of 11


Question 2.a (10 marks)
Over D = {a, b, c, d, e} compute the entropy of the convex frequency distribution:
Pr[X=xi] = [3/8, 3/16, 1/8, 1/8, 3/16].
To simplify calculations, assume the approximation that log2(3) (log in base 2 of 3) equals 1.585.
What is the maximum entropy ever attainable over D?
Your answer:


BUCI057H7 page 5 of 11


Question 2.b (15 marks)
Describe in your words was is meant by a Decision Tree and describe how to extract one from a given
annotated (X, y) dataset (where y is the annotated/dependent variable).
Your answer:


BUCI057H7 page 6 of 11


Question 3.a (10 marks)
What is the goal of Singular-Value Decomposition (SVD)?
How do we obtain the dimension of the matrices that will be generated as the result of its application?
Your answer:


BUCI057H7 page 7 of 11


Question 3.b (15 marks)
Kernelization: for a given activity data matrix D, define the Kernel matrix K(n x n) and describe why it
can be used instead of the original D in the analysis of the properties of the dataset.
Your answer:


BUCI057H7 page 8 of 11


Question 4.a (10 marks)
In the analysis of activity matrices, why are the results of Singular-Value Decomposition not generally
“interpretable?”
Why then are the results of Non-negative Matrix Factorization (NMF) interpretable?
Your answer:

BUCI057H7 page 9 of 11


Question 4.b (15 marks)
Support Vector Machines: consider the figure below that shows data from two classes. Explain the lines.

Are the classes represented in the figure above linearly separable? Propose a general (no need to
determine the exact parameters) kernel function that would be appropriate for this case.
Your answer:


BUCI057H7 page 10 of 11


Question 5.a (10 marks)
Describe a graph representation for the International Trade Network.
Define the clustering coefficient and give an example interpretation for it either in the context of
international commerce or in another network of your liking (which you define in your answer).
Your answer:


BUCI057H7 page 11 of 11


Question 5.b (15 marks)
In networks, what interesting features are captured by the Closeness and Betweenness centrality
measures?
Can you describe a practical example where centrality measures as these could reveal some interesting
property?
Your answer:



欢迎咨询51作业君
51作业君

Email:51zuoyejun

@gmail.com

添加客服微信: abby12468