Week 4 Solution PDS

Uploaded by

Netaji Gandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views9 pages

Week 4 Solution PDS

Uploaded by

Netaji Gandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

NPTEL-PYTHON FOR DATA SCIENCE

ASSIGNMENT-4-SOLUTION

1. Answer: B:pandas.get_dummies():
• This function will encode dummy values for each categorical variable. Each
category will be added as a new column in the dataframe.

2. Answer:D: Three key benefits of performing feature selection on your data are:
• Reduces Overfitting: Less redundant data means fewer error due to noise
• Improves Accuracy: Removing redundant data improves accuracy
• Reduces Training Time: Less data means that algorithms train faster

3. Answer:C: sklearn.model_selection.train_test_split()
• The dataset is usually split into training data and test data. The model learns from
the training data. We use the test dataset in order to test our model’s predictions.
4. Answer:B
• k is the number of nearest neighbours used to predict the class

5. Answer:C: sklearn.neighbors.KNeighborsClassifier()
• The sklearn library has provided a layer of abstraction on top of Python
• Therefore, in order to make use of the KNN algorithm, it’s sufficient to create an
instance of KNeighborsClassifier.

6. Answer:A
The standardized residuals of a model are plotted against the predicted values.
This is called a residual plot. When the residuals’ variance is not equal(constant)
then it is called Heteroscedasticity.
7. Answer:B:
R-squared is the percentage of the response variable variation that is explained by
a linear model. R-squared is always between 0 and 1 where:
o 0 indicates that the model explains none of the variability of the response
variable is explained by the model.
o 1 indicates that the model explains all the variability of the response
variable is explained by the model.
8. Answer:A
• The number of correct and incorrect predictions are summarized with count
values
• The number of participants that have been wrongly classified as female is 15
9. Answer:D
• The Akaike information criterion (AIC) is an estimator of the relative quality of
statistical models for a given set of data
• Thus, AIC provides a means for model selection
10. Answer: D
• Maximum likelihood will provide values of β0 and β1 which maximize the
probability of the occurrence of the dependent variable
• We use the log-likelihood function to estimate the probability of observing the
dependent variable, given the unknown parameters (β0 and β1)
11. Answer: A

• The degree of Gini index ranges between 0 and 1, where 0 denotes that all
elements belong to one class and 1 denotes that the elements are randomly
distributed across various classes
Use the following codes to import your data and then proceed
with the questions:

12. INPUT

OUTPUT

INFRENCE: Answer: D
None of the variables in the data has missing values.
13. INPUT:
OUTPUT:

INFRENCE: Answer: B
The third quartile for the variable “lastEvaluation” is 0.87.
14. INPUT:

OUTPUT:

INFRENCE: Answer: C
The “SALES” department has the highest frequency in low salary category
15. INPUT:

OUTPUT:

INFRENCE: Answer: B
From the above plot we can see that the median value for the “numberOfProjects” where the
employees have worked on is “4”.
16. & 17: INPUT:
OUTPUT:

INFRENCE: Answer for Q:16: A and Answer for Q:17: D

The Accuracy of our model is “80%” and the number of Misclassified samples are “745”.
18. INPUT:
OUTPUT:

INFRENCE: Answer: C
From the plot we can see that the range in which the number of employees worked for 150 hours per
month is Above 2500.

19. INPUT:
OUTPUT:

INFRENCE: Answer: A
The accuracy score of the predicted model is 95%.

20. INPUT:
OUTPUT:

INFRENCE: Answer: C
From the plot we can see that, the people who have worked in two projects performance level is
low not high.

Practice Exam
No ratings yet
Practice Exam
6 pages
ITAE002
0% (1)
ITAE002
10 pages
LASU 2024_2025 ADMISSION SCREENING RESULT (UTME) omodara
No ratings yet
LASU 2024_2025 ADMISSION SCREENING RESULT (UTME) omodara
1 page
Linear Regression Assignment
0% (2)
Linear Regression Assignment
8 pages
Sidewalks, Islands,&Medians Design Manuals (MOMRA) - (English)
100% (3)
Sidewalks, Islands,&Medians Design Manuals (MOMRA) - (English)
111 pages
SMAI Question Papers
No ratings yet
SMAI Question Papers
13 pages
Orion 18 Tutorials
78% (9)
Orion 18 Tutorials
77 pages
Durian Production Guide
50% (2)
Durian Production Guide
26 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
71 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
69 pages
Question Bank1
No ratings yet
Question Bank1
9 pages
exam-srm-sample-questions
No ratings yet
exam-srm-sample-questions
77 pages
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
No ratings yet
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
2 pages
Assignment 6: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Uct633 Mst e Mar25
No ratings yet
Uct633 Mst e Mar25
2 pages
SDS Solution1
No ratings yet
SDS Solution1
26 pages
Exam SRM Sample Questions 2
No ratings yet
Exam SRM Sample Questions 2
60 pages
ESA- QP_UE19-20CS203_SDS_Scheme and Solution
No ratings yet
ESA- QP_UE19-20CS203_SDS_Scheme and Solution
12 pages
Data Final
No ratings yet
Data Final
17 pages
DATT - Class 05 - Assignment - GR 9
No ratings yet
DATT - Class 05 - Assignment - GR 9
9 pages
Compre FoDS
No ratings yet
Compre FoDS
2 pages
DM-I Q Paper 2024
No ratings yet
DM-I Q Paper 2024
12 pages
Practical 7 Classification Revision Questions
No ratings yet
Practical 7 Classification Revision Questions
8 pages
Question - Bank (MCQ) - Advance Analytics - Question Bank eDBDA Sept 21
No ratings yet
Question - Bank (MCQ) - Advance Analytics - Question Bank eDBDA Sept 21
14 pages
MCQs Dumps 2
No ratings yet
MCQs Dumps 2
15 pages
Sample Questions: Basic Statistics
No ratings yet
Sample Questions: Basic Statistics
5 pages
DM Makeup Key
No ratings yet
DM Makeup Key
6 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
ML 2023a Midsem Solution
No ratings yet
ML 2023a Midsem Solution
9 pages
Comp - Sem VI - Quantitative Analysis+Sample Questions
No ratings yet
Comp - Sem VI - Quantitative Analysis+Sample Questions
10 pages
ML Unit 1 MCQ
100% (1)
ML Unit 1 MCQ
9 pages
Analytics Quiz and Case Study
No ratings yet
Analytics Quiz and Case Study
12 pages
ML 1
No ratings yet
ML 1
51 pages
Sample Quiz1 Questions
No ratings yet
Sample Quiz1 Questions
8 pages
Mid Semester Regular-DM
No ratings yet
Mid Semester Regular-DM
3 pages
Soal CISDM
No ratings yet
Soal CISDM
3 pages
339 - DADMB End Term
No ratings yet
339 - DADMB End Term
3 pages
Appc 2.6 Packet
No ratings yet
Appc 2.6 Packet
7 pages
Graded Quiz Unit 3 PDF
No ratings yet
Graded Quiz Unit 3 PDF
10 pages
2024 Fods Ques
No ratings yet
2024 Fods Ques
4 pages
All The Previous Questions
No ratings yet
All The Previous Questions
37 pages
2024_PCS_24P2CSC04_Question Bank ML
No ratings yet
2024_PCS_24P2CSC04_Question Bank ML
7 pages
18CSO106T Data Analysis Using Open Source Tool: Question Bank
No ratings yet
18CSO106T Data Analysis Using Open Source Tool: Question Bank
26 pages
List_Midterm_1 ML
No ratings yet
List_Midterm_1 ML
6 pages
COMP 1003&1433 Midterm (Tuesday)
No ratings yet
COMP 1003&1433 Midterm (Tuesday)
8 pages
Data Science and ML - End Term
No ratings yet
Data Science and ML - End Term
4 pages
DAV_practicle_File
No ratings yet
DAV_practicle_File
28 pages
Mid-Semester Regular Data Mining QP v1 PDF
No ratings yet
Mid-Semester Regular Data Mining QP v1 PDF
2 pages
MLP Question Bank of AI and ML and NLP
No ratings yet
MLP Question Bank of AI and ML and NLP
7 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
30 pages
Machine Learning,( CS-3035), Online Spring End Semester Examination 2021
No ratings yet
Machine Learning,( CS-3035), Online Spring End Semester Examination 2021
8 pages
Auronova Consulting
No ratings yet
Auronova Consulting
8 pages
Exam All Questions
No ratings yet
Exam All Questions
566 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
nptel-assignment-answers
No ratings yet
nptel-assignment-answers
52 pages
DIT865 2018 Mar Solution
No ratings yet
DIT865 2018 Mar Solution
9 pages
CS F320 - Assignment II - Draft (Subject to a Few Changes in the Description of Problems)
No ratings yet
CS F320 - Assignment II - Draft (Subject to a Few Changes in the Description of Problems)
12 pages
ERERER
No ratings yet
ERERER
1 page
Compre FoDS
No ratings yet
Compre FoDS
2 pages
DS assignment COMPLETED DOC
No ratings yet
DS assignment COMPLETED DOC
11 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
ds
No ratings yet
ds
22 pages
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Assignment 4
No ratings yet
Assignment 4
3 pages
SRP
No ratings yet
SRP
3 pages
Week 11
No ratings yet
Week 11
3 pages
Assignment 12-New
No ratings yet
Assignment 12-New
4 pages
Assignment Solutions 5
No ratings yet
Assignment Solutions 5
3 pages
Week 9
No ratings yet
Week 9
3 pages
Outcome Based Pedagogic Principles For Effective Teaching NPTEL
100% (1)
Outcome Based Pedagogic Principles For Effective Teaching NPTEL
18 pages
JP - IT-A 2023-Updated
No ratings yet
JP - IT-A 2023-Updated
11 pages
Programming For Problem Solving - 1
No ratings yet
Programming For Problem Solving - 1
10 pages
Timetable Rprogramming
No ratings yet
Timetable Rprogramming
1 page
Statistics With R Programming
No ratings yet
Statistics With R Programming
2 pages
T-Sheet: I B.Tech I Semester (VR23) Regular Examinations, February 2024 Branch: IT 23NM1A1201 SGPA: 8.37 CGPA: 8.37
No ratings yet
T-Sheet: I B.Tech I Semester (VR23) Regular Examinations, February 2024 Branch: IT 23NM1A1201 SGPA: 8.37 CGPA: 8.37
21 pages
Internal Winners & Runners List, 25.1.24
No ratings yet
Internal Winners & Runners List, 25.1.24
10 pages
IML-IITKGP - Assignment 1 Solution
No ratings yet
IML-IITKGP - Assignment 1 Solution
7 pages
Transmission Lines Course - 2
No ratings yet
Transmission Lines Course - 2
54 pages
Brooks Etal 2010 GlobalBiodiversityConservationpriorities Expandedreview
No ratings yet
Brooks Etal 2010 GlobalBiodiversityConservationpriorities Expandedreview
24 pages
Installation Instructions: LRM1070, LRM1080
No ratings yet
Installation Instructions: LRM1070, LRM1080
2 pages
Journal Pre-Proof: Physical Therapy in Sport
No ratings yet
Journal Pre-Proof: Physical Therapy in Sport
59 pages
Internet of Things: Iot Demand in Saudi Arabia A Survey-Based Study
No ratings yet
Internet of Things: Iot Demand in Saudi Arabia A Survey-Based Study
31 pages
FS 2 Activity 1
No ratings yet
FS 2 Activity 1
8 pages
Reward Management in Tesco
No ratings yet
Reward Management in Tesco
6 pages
Sheri and Crisma
No ratings yet
Sheri and Crisma
7 pages
OB2 Session4
No ratings yet
OB2 Session4
11 pages
Banzai PDF
No ratings yet
Banzai PDF
4 pages
Trademarks and Service Marks
No ratings yet
Trademarks and Service Marks
4 pages
Groningen Social Disabilities Schedule (GSDS-II) SAMPLE
No ratings yet
Groningen Social Disabilities Schedule (GSDS-II) SAMPLE
8 pages
Design and Implementation of Online Clearance System
0% (1)
Design and Implementation of Online Clearance System
8 pages
Instructional Materials For Media and Information Literacy: Nix (COMPANY NAME) (Company Address)
No ratings yet
Instructional Materials For Media and Information Literacy: Nix (COMPANY NAME) (Company Address)
56 pages
MIR VALVE API 6D Ball Valve Catalogue Rev2 Feb. 2014
No ratings yet
MIR VALVE API 6D Ball Valve Catalogue Rev2 Feb. 2014
16 pages
Heb 0119
No ratings yet
Heb 0119
1 page
DP-203 Updated Dumps - Data Engineering On Microsoft Azure
No ratings yet
DP-203 Updated Dumps - Data Engineering On Microsoft Azure
60 pages
الهندسة الوصفية
No ratings yet
الهندسة الوصفية
14 pages
Cryogenic Brake Rotors: by John Bellah
No ratings yet
Cryogenic Brake Rotors: by John Bellah
6 pages
United States Court of Appeals, Second Circuit.: No. 600, Docket 78-2106
No ratings yet
United States Court of Appeals, Second Circuit.: No. 600, Docket 78-2106
6 pages
Microsoft Customers Using Bing Maps Unlimited TRX (Add-On SL) - Sales Intelligence™ Report
No ratings yet
Microsoft Customers Using Bing Maps Unlimited TRX (Add-On SL) - Sales Intelligence™ Report
17 pages
Compatibility of Biodiesel Fuel With Met
No ratings yet
Compatibility of Biodiesel Fuel With Met
10 pages
Seminar Report
No ratings yet
Seminar Report
23 pages
Urgent Motion To Cancel: Republic of The Philippines National Capital Judicial Region BRANCH
No ratings yet
Urgent Motion To Cancel: Republic of The Philippines National Capital Judicial Region BRANCH
2 pages
Bisette, Vincent Van Der Post, Hayden Excel For Finance & Accounting PDF
No ratings yet
Bisette, Vincent Van Der Post, Hayden Excel For Finance & Accounting PDF
238 pages
Cloudflare Unveils AI Labyrinth_ A New Approach to Exhaust AI Crawlers
No ratings yet
Cloudflare Unveils AI Labyrinth_ A New Approach to Exhaust AI Crawlers
4 pages

Week 4 Solution PDS

Uploaded by

Week 4 Solution PDS

Uploaded by

NPTEL-PYTHON FOR DATA SCIENCE

INFRENCE: Answer for Q:16: A and Answer for Q:17: D

You might also like