how can i test my model using different dataset in machine learning

Question

Home

how can i test my model using different dataset in machine learning

0

i m new in machine learning and i am create a one small project using CountVectorizer model. i am split my data to 80% -20%. 80% for training model and 20% for testing model. my model work properly run on 20% test data but can i used to test my model on different data set that is similar to training data set?

i am using joblib for dump and load my model.

from joblib import dump, load dump(pipe, filename)  loaded_model = load('filename')

my question is how i directly test my model using different dataset?

Jerrybrandyella Asked on July 16, 2020 in Python.

Share
Comment(0)

Add Comment

1 Answer(s)

Votes
Oldest

0

Yes, you can use the model to test similar datasets.

However, you must keep in mind the preprocessing step according to the model.

When you trained your model, it was trained on a particular dimension and the size of input would have been AxB matric. When you have a new test sentence or new dataset, you must first do the same preprocessing, otherwise, it will throw dimension mismatch errors.

Example:

suppose you have the following count vectorizer object

cv = CountVectorizer()

then you must first fit it on your training dataset, for say

X = dataframe['text_column_name'] X = cv.fit_transform(X) # Fit the Data

Once this is done, whenever you have a new sentence, say

test_sentence = "this is a test sentence"

then you must use the cv object in the following manner

model_input = cv.transform([test_sentence]).toarray()

and then you can make predictions:

model.predict(model_input)

This method must be followed even if you wish to test a new dataset which is in a data frame or some other file format.

Sherwoodrubenelinor Answered on July 16, 2020.

Share
Comment(0)

Add Comment

Your Answer

Answer 1

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 2

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 3

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 4

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 5

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 6

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 7

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 8

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

Answer 9

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 10

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 11

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 12

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 13

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 14

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 15

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 16

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

LATEST ANSWERS

how can i test my model using different dataset in machine learning

Your Answer

TOP USERS

HOT QUESTIONS

LATEST ANSWERS

how can i test my model using different dataset in machine learning

Your Answer

Tags Widget

TOP USERS

HOT QUESTIONS