xgboost in python not giving consistent model performance

Question

Home

xgboost in python not giving consistent model performance

0

I have been having the same problem with multiple datasets. I trained xgboost model in python on different datasets and used randomsearch hyper parameter tuning. The problem is that if I change random seeds, the model behavior on my out of sample changes. Sometimes recall changes from 40% to 60% on the same dataset. One problem is that my datasets are always small (less than a few thousand records). I tried to use cross-validation with different number of folds (5 to 20) and considering a wide range for the parameters that I try to tune but still same problem exists.

I wonder what other options is there that I can try to have a more consistent model. my dataset is pretty balanced so I dont think undersampling/over sampling would help me. I also tried doing a gridsearch rather than random search but then I had to have a smaller range for parameters so the model training does not take long.

Thanks for the help

Mohamedcarriejeanine Asked on July 15, 2020 in Python.

Share
Comment(0)

Add Comment

0 Answer(s)

Votes
Oldest

Your Answer

Answer 1

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 2

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 3

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 4

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 5

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 6

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 7

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 8

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

Answer 9

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 10

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 11

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 12

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 13

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 14

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 15

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 16

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

LATEST ANSWERS

xgboost in python not giving consistent model performance

Your Answer

TOP USERS

HOT QUESTIONS

LATEST ANSWERS

xgboost in python not giving consistent model performance

Your Answer

Tags Widget

TOP USERS

HOT QUESTIONS