0% found this document useful (0 votes)

51 views9 pages

Name: Suprit Darshan Shrestha Reg - no:19BCE2584: Lab DA1 Machine Learning Lab

The document summarizes machine learning experiments on a dataset using decision tree algorithms like ID3 and CART. It performs data cleaning, splits the data into training and test sets, builds decision tree models and evaluates their accuracy. It also visualizes the decision tree using scikit-learn and pydotplus to get a clean visualization. The conclusion states that decision trees are simple yet effective tools for discrimination problems that can be easily explained to non-experts without complex math.

Uploaded by

Suprit D. Shrestha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views9 pages

Name: Suprit Darshan Shrestha Reg - no:19BCE2584: Lab DA1 Machine Learning Lab

Uploaded by

Suprit D. Shrestha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Lab DA1

Machine Learning Lab

Name: Suprit Darshan Shrestha
Reg.no:19BCE2584
Data cleaning

Combined different columns with close and similar values to dcrease the
number of columns.

S_algo:
Code:
import csv
a = []
with open('Data1.csv', 'r') as csvfile:
next(csvfile)
for row in csv.reader(csvfile):
a.append(row)
print(a)

print("\nThe total number of training instances are : ",len(a))

num_attribute = len(a[0])-1

print("\nThe initial hypothesis is : ")

hypothesis = ['0']*num_attribute
print(hypothesis)

for i in range(0, len(a)):

if a[i][num_attribute] == "0":
print ("\nInstance ", i+1, "is", a[i], " and is Positive Instance")
for j in range(0, num_attribute):
if hypothesis[j] == '0' or hypothesis[j] == a[i][j]:
hypothesis[j] = a[i][j]
else:
hypothesis[j] = '?'
print("The hypothesis for the training instance", i+1," is: " , hypothesis, "\n")

if a[i][num_attribute] == "1":
print ("\nInstance ", i+1, "is", a[i], " and is Negative Instance Hence Ignored")
print("The hypothesis for the training instance", i+1," is: " , hypothesis, "\n")

print("\nThe Maximally specific hypothesis for the training instance is ", hypothesis)
Output:
Decision Tree:
Code:
import pandas as pd
dataset=pd.read_csv('Data1.csv')
X = dataset.iloc[:, :-1].values
y = dataset.iloc[:, -1].values
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.25, random_state = 0)
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)

from sklearn.tree import DecisionTreeClassifier

classifier = DecisionTreeClassifier(criterion = 'entropy', random_state = 0)
classifier.fit(X_train, y_train)

DecisionTreeClassifier(class_weight=None, criterion='entropy', max_depth=None,

max_features=None, max_leaf_nodes=None,
min_samples_leaf=1, min_samples_split=2,
min_weight_fraction_leaf=0.0,
random_state=0, splitter='best')

y_pred = classifier.predict(X_test)
from sklearn import metrics #Import scikit-learn metrics module for accuracy calculation

print("Accuracy:",metrics.accuracy_score(y_test, y_pred))
Output:

ID3:
Code:
import pandas as pd
import math
import numpy as np

data = pd.read_csv('Data1.csv')
features = [feat for feat in data]
features.remove("Diagnosis")

class Node:
def __init__(self):
self.children = []
self.value = ""
self.isLeaf = False
self.pred = ""

def entropy(examples):
pos = 0.0
neg = 0.0
for _, row in examples.iterrows():
if row["Diagnosis"] == 0:
pos += 1
else:
neg += 1
if pos == 0.0 or neg == 0.0:
return 0.0
else:
p = pos / (pos + neg)
n = neg / (pos + neg)
return -(p * math.log(p, 2) + n * math.log(n, 2))

def info_gain(examples, attr):

uniq = np.unique(examples[attr])
print ("\n",uniq)
gain = entropy(examples)
#print ("\n",gain)
for u in uniq:
subdata = examples[examples[attr] == u]
#print ("\n",subdata)
sub_e = entropy(subdata)
gain -= (float(len(subdata)) / float(len(examples))) * sub_e
#print ("\n",gain)
return gain

def ID3(examples, attrs):

root = Node()

max_gain = 0
max_feat = ""
for feature in attrs:
#print ("\n",examples)
gain = info_gain(examples, feature)
if gain > max_gain:
max_gain = gain
max_feat = feature
root.value = max_feat
#print ("\nMax feature attr",max_feat)
uniq = np.unique(examples[max_feat])
#print ("\n",uniq)
for u in uniq:
#print ("\n",u)
subdata = examples[examples[max_feat] == u]
#print ("\n",subdata)
if entropy(subdata) == 0.0:
newNode = Node()
newNode.isLeaf = True
newNode.value = u
newNode.pred = np.unique(subdata["Diagnosis"])
root.children.append(newNode)
else:
dummyNode = Node()
dummyNode.value = u
new_attrs = attrs.copy()
new_attrs.remove(max_feat)
child = ID3(subdata, new_attrs)
dummyNode.children.append(child)
root.children.append(dummyNode)
return root

def printTree(root: Node, depth=0):

for i in range(depth):
print("\t", end="")
print(root.value, end="")
if root.isLeaf:
print(" -> ", root.pred)
print()
for child in root.children:
printTree(child, depth + 1)

root = ID3(data, features)

printTree(root)
Output:
Clean visualization:
Code:
feature_cols = ['area','peri','ECC','solidity','orient_wbc','nuc,AVG','entropy_cyt','AXIS','Diagnosis']

from sklearn.datasets import load_iris

from sklearn import tree
import six
import sys
sys.modules['sklearn.externals.six'] = six
from sklearn.tree import export_graphviz
from sklearn.externals.six import StringIO
from IPython.display import Image
import pydotplus
dot_data = StringIO()
clf = DecisionTreeClassifier()
clf = clf.fit(X_train,y_train)
export_graphviz(clf, out_file=dot_data,
filled=True, rounded=True,
special_characters=True,feature_names =feature_cols,class_names=['0','1'])
graph = pydotplus.graph_from_dot_data(dot_data.getvalue())
graph.write_png('diabetes.png')
Image(graph.create_png())
Output:
Conclusion:
Decision trees are simply responding to a discriminating problem, and they are one of the few approaches
for processing data that can be presented rapidly to a non-specialist public without getting lost in hard
mathematical formulas.

ISAM CLI Command Guide (ISAM+LAN Switch) PDF
No ratings yet
ISAM CLI Command Guide (ISAM+LAN Switch) PDF
835 pages
ONTAP Data Protection Administration Student Guide
100% (1)
ONTAP Data Protection Administration Student Guide
249 pages
Oracle Tuning AWR Code Depot
No ratings yet
Oracle Tuning AWR Code Depot
146 pages
Lab Manual
No ratings yet
Lab Manual
25 pages
MLExp 3
No ratings yet
MLExp 3
6 pages
ML lab manual
No ratings yet
ML lab manual
25 pages
ML LAB P-1
No ratings yet
ML LAB P-1
10 pages
Aiml Lab
No ratings yet
Aiml Lab
14 pages
AIML
No ratings yet
AIML
12 pages
ML File
No ratings yet
ML File
13 pages
Codes & Outputs (1)
No ratings yet
Codes & Outputs (1)
9 pages
ML Lab Prog1-5 (5) College PDF
No ratings yet
ML Lab Prog1-5 (5) College PDF
12 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
Import Import Def
No ratings yet
Import Import Def
2 pages
Machine Learning Lab Record: Dr. Sarika Hegde
No ratings yet
Machine Learning Lab Record: Dr. Sarika Hegde
23 pages
Machine Learning practical file
No ratings yet
Machine Learning practical file
31 pages
DOC-20250509-WA0027.
No ratings yet
DOC-20250509-WA0027.
34 pages
ML Lab Programs 1-10-Converted NAM COLLEGE PDF
No ratings yet
ML Lab Programs 1-10-Converted NAM COLLEGE PDF
33 pages
prgm 4
No ratings yet
prgm 4
3 pages
Document
No ratings yet
Document
7 pages
ML Lab File Batch 1
No ratings yet
ML Lab File Batch 1
20 pages
ML Lab Record
No ratings yet
ML Lab Record
33 pages
15CSL76 Students
No ratings yet
15CSL76 Students
18 pages
Ml Lab Record
No ratings yet
Ml Lab Record
49 pages
Machine Learning Through Python Lab Mannual
No ratings yet
Machine Learning Through Python Lab Mannual
33 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
28 pages
Machine Learning Laboratory Manual
No ratings yet
Machine Learning Laboratory Manual
11 pages
Machine Learning Laboratory Record Book: 1 Find S Algorithm
No ratings yet
Machine Learning Laboratory Record Book: 1 Find S Algorithm
22 pages
id3
No ratings yet
id3
1 page
Lab 3 ml
No ratings yet
Lab 3 ml
3 pages
Machine Learning Lab (17CSL76)
No ratings yet
Machine Learning Lab (17CSL76)
48 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
ai int-1
No ratings yet
ai int-1
6 pages
code mlt
No ratings yet
code mlt
9 pages
AIML Prograns
No ratings yet
AIML Prograns
6 pages
Lab Manual ML
No ratings yet
Lab Manual ML
28 pages
Pra 5 ML
No ratings yet
Pra 5 ML
5 pages
Machine Learning Laboratory (21AIL66)
No ratings yet
Machine Learning Laboratory (21AIL66)
7 pages
Machine Learning Lab: Delhi Technological University
No ratings yet
Machine Learning Lab: Delhi Technological University
6 pages
ML Exp 3
No ratings yet
ML Exp 3
6 pages
Lab_Manual2 (2)
No ratings yet
Lab_Manual2 (2)
6 pages
R20 Iii-Ii ML Lab Manual
100% (1)
R20 Iii-Ii ML Lab Manual
79 pages
MLT Shivani
No ratings yet
MLT Shivani
8 pages
Rabia Malik (s0001)
No ratings yet
Rabia Malik (s0001)
5 pages
ML Lab Manual PDF
No ratings yet
ML Lab Manual PDF
9 pages
PRG 4
No ratings yet
PRG 4
2 pages
PESIT Bangalore South Campus: Vii Semester Lab Manual Subject: Machine Learning
No ratings yet
PESIT Bangalore South Campus: Vii Semester Lab Manual Subject: Machine Learning
31 pages
MANUAL (2)
No ratings yet
MANUAL (2)
33 pages
Machine learning
No ratings yet
Machine learning
27 pages
LAB 3
No ratings yet
LAB 3
7 pages
AD3461 ML lab manual
No ratings yet
AD3461 ML lab manual
32 pages
ML Lab
No ratings yet
ML Lab
9 pages
DECISION TREES
No ratings yet
DECISION TREES
7 pages
Amit MLT1
No ratings yet
Amit MLT1
22 pages
Machine Learning Manual Final
No ratings yet
Machine Learning Manual Final
37 pages
ML Lab Manual
No ratings yet
ML Lab Manual
90 pages
ID3 Program4
No ratings yet
ID3 Program4
3 pages
DA_LAB3_221IT064
No ratings yet
DA_LAB3_221IT064
6 pages
Machine Learning Lab Manual (1) (1)
No ratings yet
Machine Learning Lab Manual (1) (1)
26 pages
ML Experiments
No ratings yet
ML Experiments
22 pages
Program 1
No ratings yet
Program 1
25 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Linux Networking & Security Fundamentals: Welcome Everyone!
No ratings yet
Linux Networking & Security Fundamentals: Welcome Everyone!
35 pages
Unit -5 OODA
No ratings yet
Unit -5 OODA
17 pages
Review Questions On Network and Communications
No ratings yet
Review Questions On Network and Communications
6 pages
Somansa Introduces Mail-I Cloud, Network Data Loss Prevention, DLP For Web
No ratings yet
Somansa Introduces Mail-I Cloud, Network Data Loss Prevention, DLP For Web
2 pages
Diagnostic Scripts For General Ledger
No ratings yet
Diagnostic Scripts For General Ledger
2 pages
11.aexport Eng
No ratings yet
11.aexport Eng
14 pages
Dbms Notes
No ratings yet
Dbms Notes
48 pages
Easy Steganography - Writeup: These Challenges
No ratings yet
Easy Steganography - Writeup: These Challenges
6 pages
Acano Manager Installation Guide R1.1
No ratings yet
Acano Manager Installation Guide R1.1
31 pages
Ambari Installation Guide PDF
No ratings yet
Ambari Installation Guide PDF
54 pages
Midterm With Ans
No ratings yet
Midterm With Ans
8 pages
Linked List
No ratings yet
Linked List
104 pages
SQL Detach Database Approach 1
No ratings yet
SQL Detach Database Approach 1
40 pages
How To Use The Browse Tree Guide: Background Information
No ratings yet
How To Use The Browse Tree Guide: Background Information
174 pages
String Handling (StringBuilder Class)
No ratings yet
String Handling (StringBuilder Class)
27 pages
Unix Commands
100% (1)
Unix Commands
56 pages
In This Document: Applies To
No ratings yet
In This Document: Applies To
13 pages
Advantech XPe Add-Ons
No ratings yet
Advantech XPe Add-Ons
28 pages
Gunshot Bits Computer Architecture Unit 4
No ratings yet
Gunshot Bits Computer Architecture Unit 4
7 pages
CREATE BUFFERPOOL Statement: Invocation
No ratings yet
CREATE BUFFERPOOL Statement: Invocation
3 pages
Final Slip
No ratings yet
Final Slip
79 pages
802.11 Fundamental: Beacon, Probe, Authentication, Association
No ratings yet
802.11 Fundamental: Beacon, Probe, Authentication, Association
34 pages
MPMC - Notes T Apparao
No ratings yet
MPMC - Notes T Apparao
105 pages
Network Administration Laboratory Manual
No ratings yet
Network Administration Laboratory Manual
322 pages
School Form Checking For G7 Advisers
No ratings yet
School Form Checking For G7 Advisers
3 pages
Types of Computer: It Is A Midsize Multi-Processing System Capable of Supporting Up To 250 Users Simultaneously
No ratings yet
Types of Computer: It Is A Midsize Multi-Processing System Capable of Supporting Up To 250 Users Simultaneously
5 pages
NOJA-519-10 DNP3 Protocol Implementation
100% (2)
NOJA-519-10 DNP3 Protocol Implementation
41 pages

Name: Suprit Darshan Shrestha Reg - no:19BCE2584: Lab DA1 Machine Learning Lab

Uploaded by

Name: Suprit Darshan Shrestha Reg - no:19BCE2584: Lab DA1 Machine Learning Lab

Uploaded by

Lab DA1

Machine Learning Lab

print("\nThe total number of training instances are : ",len(a))

print("\nThe initial hypothesis is : ")

for i in range(0, len(a)):

from sklearn.tree import DecisionTreeClassifier

DecisionTreeClassifier(class_weight=None, criterion='entropy', max_depth=None,

def info_gain(examples, attr):

def ID3(examples, attrs):

def printTree(root: Node, depth=0):

root = ID3(data, features)

from sklearn.datasets import load_iris

You might also like