Adam White asked . 2022-04-21

fitrsvm fails if epsilon is generated using a for loop

I wanted to run a grid search to find suitable parameters for my SVM model but I have discovered that fitrsvm gives inconsistent errors if the value of the epsilon parameter is generated using a ‘for loop’. For example the RMSE for my model with epsilon = 0.8 will be different if I use the for loop:
 
for epsilon = 0.8:.1:1.2
 
compared with if I use the for loop
 
for epsilon = 0.1:.1:1.2
 
The RMSEs are 2.6868 and 2.7020 respectively
 
I thought this might be some floating point error, so I tried to ensure that the epsilon value passed to fitrsvm was exactly 0.8. I did this by creating variable d_epsilon (line 17) and passing its value to fitrsvm (ie by changing line 26 to ‘Epsilon’ = d_epsilon but this did not work. By contrast using c_epsilon which is completely independent of the for loop (line 16) does work.
 
In my real project, I use nested loops to search for values for Epsilon, Boxconstraint, and KernelScale. The inconsistencies in my results are about 10%. (I am using a grid search as the parameters returned using OptimizeHyperparameters perform worse that some of the parameters cited in journal articles for my dataset (UCI’s auto-mpg).
 
 
clear all
%%read in auto-mpg.csv. This is a cleaned version of UCI dataset auto-mpg
data = readtable('auto-mpg.csv','ReadVariableNames',false);
VarNames = {'mpg','cylinders' 'displacement' 'horsepower' 'weight' 'acceleration' ...
    'modelYear' 'origin' 'carName'};
data.Properties.VariableNames = VarNames;
data = [data(:,2:9) data(:,1)];
data.carName=[];

%%carry out 10 fold cross-validation with different epsilon values
testResults_SVM=[];
testActual_SVM=[];
rng('default')
c = cvpartition(data.mpg,'KFold',10);
for epsilon = 0.1:0.1:1.2
    %c_epsilon= 0.80000;
    %d_epsilon = str2double(string(round(epsilon,2)))
    for fold = 1:10
        cv_trainingData = data(c.training(fold), :);   
        cv_testData = data(c.test(fold), :);
        AutoSVM = fitrsvm(cv_trainingData,'mpg',...
                'KernelFunction', 'gaussian', ...
                'PolynomialOrder', [], ...
                'KernelScale', 5.5, ...
                'BoxConstraint', 100, ...
                'Epsilon', epsilon, ...
                'Standardize', true);
        convergenceChk(fold)=AutoSVM.ConvergenceInfo.Converged; 
        testResults_SVM=[testResults_SVM;predict(AutoSVM,cv_testData)];
        testActual_SVM=[testActual_SVM;cv_testData.mpg];
     end
  %%generate summary statistics and plots
    residual_SVM = testResults_SVM-testActual_SVM;
    AutoMSE_SVM=((sum((residual_SVM).^2))/size(testResults_SVM,1));
    AutoRMSE_SVM = sqrt(AutoMSE_SVM);
    if round(epsilon,4) == 0.8
        AutoRMSE_SVM
    end    
end

 

parameter fitting , AI, Data Science, and Statistics , Statistics and Machine Learning Toolbox ,

Expert Answer

Prashant Kumar answered . 2024-05-06 13:41:02

Can we simplify things a bit? Here's a version of your code that uses built-in validation instead of explicit loops.
 
The first loop below uses the range .1:.1:1.2. The second uses .8:.1:1.2, and the third uses the values .1,.2,...,1.2 individually.
 
In all 3 cases the cross-validation loss of the SVM is exactly the same. Notice that this is true even though there is roundoff error in the epsilons calculated in the first loop compared to the individual values in the last loop. So the SVM fitting is robust to tiny differences in epsilon (on the order of 1e-15 here).
 
So it doesn't look like SVM has a problem with epsilons generated in a loop.
 
 
clear all
%%read in auto-mpg.csv. This is a cleaned version of UCI dataset auto-mpg
data = readtable('auto-mpg.csv','ReadVariableNames',false);
VarNames = {'mpg','cylinders' 'displacement' 'horsepower' 'weight' 'acceleration' ...
    'modelYear' 'origin' 'carName'};
data.Properties.VariableNames = VarNames;
data = [data(:,2:9) data(:,1)];
data.carName=[];

rng('default')
c = cvpartition(data.mpg,'KFold',10);
LossesLoop1 = [];
LossesLoop2 = zeros(1,7);
LossesIndividual = [];
for epsilon = 0.1:0.1:1.2
    AutoSVM = fitrsvm(data,'mpg',...
        'CVPartition', c,...
        'KernelFunction', 'gaussian', ...
        'PolynomialOrder', [], ...
        'KernelScale', 5.5, ...
        'BoxConstraint', 100, ...
        'Epsilon', epsilon, ...
        'Standardize', true);
    LossesLoop1(end+1) = kfoldLoss(AutoSVM);
end

for epsilon = 0.8:0.1:1.2
    AutoSVM = fitrsvm(data,'mpg',...
        'CVPartition', c,...
        'KernelFunction', 'gaussian', ...
        'PolynomialOrder', [], ...
        'KernelScale', 5.5, ...
        'BoxConstraint', 100, ...
        'Epsilon', epsilon, ...
        'Standardize', true);
    LossesLoop2(end+1) = kfoldLoss(AutoSVM);
end

for epsilon = [.1 .2 .3 .4 .5 .6 .7 .8 .9 1 1.1 1.2]
    AutoSVM = fitrsvm(data,'mpg',...
        'CVPartition', c,...
        'KernelFunction', 'gaussian', ...
        'PolynomialOrder', [], ...
        'KernelScale', 5.5, ...
        'BoxConstraint', 100, ...
        'Epsilon', epsilon, ...
        'Standardize', true);
    LossesIndividual(end+1) = kfoldLoss(AutoSVM);
end

LossesLoop1
LossesLoop2
LossesIndividual

isequal(LossesLoop1, LossesIndividual)
isequal(LossesLoop2(8:end), LossesIndividual(8:end))

isequal(.1:.1:1.2, [.1 .2 .3 .4 .5 .6 .7 .8 .9 1 1.1 1.2])

[.1:.1:1.2] - [.1 .2 .3 .4 .5 .6 .7 .8 .9 1 1.1 1.2]

 


Not satisfied with the answer ?? ASK NOW

Frequently Asked Questions

MATLAB offers tools for real-time AI applications, including Simulink for modeling and simulation. It can be used for developing algorithms and control systems for autonomous vehicles, robots, and other real-time AI systems.

MATLAB Online™ provides access to MATLAB® from your web browser. With MATLAB Online, your files are stored on MATLAB Drive™ and are available wherever you go. MATLAB Drive Connector synchronizes your files between your computers and MATLAB Online, providing offline access and eliminating the need to manually upload or download files. You can also run your files from the convenience of your smartphone or tablet by connecting to MathWorks® Cloud through the MATLAB Mobile™ app.

Yes, MATLAB provides tools and frameworks for deep learning, including the Deep Learning Toolbox. You can use MATLAB for tasks like building and training neural networks, image classification, and natural language processing.

MATLAB and Python are both popular choices for AI development. MATLAB is known for its ease of use in mathematical computations and its extensive toolbox for AI and machine learning. Python, on the other hand, has a vast ecosystem of libraries like TensorFlow and PyTorch. The choice depends on your preferences and project requirements.

You can find support, discussion forums, and a community of MATLAB users on the MATLAB website, Matlansolutions forums, and other AI-related online communities. Remember that MATLAB's capabilities in AI and machine learning continue to evolve, so staying updated with the latest features and resources is essential for effective AI development using MATLAB.

Without any hesitation the answer to this question is NO. The service we offer is 100% legal, legitimate and won't make you a cheater. Read and discover exactly what an essay writing service is and how when used correctly, is a valuable teaching aid and no more akin to cheating than a tutor's 'model essay' or the many published essay guides available from your local book shop. You should use the work as a reference and should not hand over the exact copy of it.

Matlabsolutions.com provides guaranteed satisfaction with a commitment to complete the work within time. Combined with our meticulous work ethics and extensive domain experience, We are the ideal partner for all your homework/assignment needs. We pledge to provide 24*7 support to dissolve all your academic doubts. We are composed of 300+ esteemed Matlab and other experts who have been empanelled after extensive research and quality check.

Matlabsolutions.com provides undivided attention to each Matlab assignment order with a methodical approach to solution. Our network span is not restricted to US, UK and Australia rather extends to countries like Singapore, Canada and UAE. Our Matlab assignment help services include Image Processing Assignments, Electrical Engineering Assignments, Matlab homework help, Matlab Research Paper help, Matlab Simulink help. Get your work done at the best price in industry.