hamidreza hamidi asked . 2022-04-13

working with kolmogrov test

Hi, I am trying to use kolmogorov test which I' going to use it in my artickle , I generate a data set A then I randomly made a sample set from A. then I wanated to compare these two sample sets with kstest. but It showed me they don't have same distribution.
 
here is my simple code:
 
clc
clear all
close all

n_s = 1000;
mother_random_variable = lognrnd(0.3,0.5,[1,100000]);               %data lognormal
S = mother_random_variable(randi(numel(mother_random_variable),1,n_s))          %sample

S_y = [S]';                             %selected data 

S_mean=mean(S_y);               %mean sample
S_var=std(S_y);                 %variance sammple
test_cdf = [S_y,cdf('Lognormal',S_y,S_var,S_mean)];        %make cdf 
kstest(S_y,'CDF',test_cdf)                  %ktest
plot(sort(S_y),logncdf(sort(S_y)),'r--')
hold on
cdfplot(S_y)

they have same distribution and ITs srange result . I found more strage result when I compare my data set with itself, Its result shows me they don't have same distribution.

clc
clear all
close all

n_s = 1000;
mother_random_variable = lognrnd(0.3,0.5,[1,100000]); %data
S=mother_random_variable; % I named data with S for simpler code
S_y = [S]';     %selected data 
S_mean=mean(S_y);
S_var=std(S_y);
test_cdf = [S_y,cdf('Lognormal',S_y,S_var,S_mean)];
kstest(S_y,'CDF',test_cdf)
plot(sort(S_y),logncdf(sort(S_y)),'r--')
hold on
cdfplot(S_y)

DO you have any Idea.

kstest kolmogorov ... , AI, Data Science, and Statistics , Statistics and Machine Learning Toolbox

Expert Answer

Prashant Kumar answered . 2024-04-26 10:35:21

Having only looked at your 2nd block of code, I have some comments and suggestions.
 
1) The parameters for a lognormal distribution are mean and standard deviation in that order. In your code, you're entering them in reverse when you call the cdf() function and this is creating a totally different distribution than you intend to do.
 
 
y = cdf('Lognormal', S_y, S_var, S_mean);    % your code, incorrect
y = cdf('Lognormal', S_y, S_mean, S_var);    % correct

2) This is just a suggestion but it's a bit cleaner to use the makedist() function rather than entering the parameters manually into cdf().

doc cdf

pd = makedist('Lognormal', 'mu', S_mean, 'sigma', S_var); 
y = cdf(pd, S_y);   % instead of cdf('Lognormal', S_y, S_mean, S_var)                  

3) " when I compare my data set with itself, Its result shows me they don't have same distribution." But you aren't comparing your data with itself. You're comparing your data with the results of the cumulative distribution function of your data. The plot below shows the distribution of values from your data (top) and the distribution of values from the CDF. Clearly those distributions differ and the kstest() correctly rejects the null hypothesis.

figure
subplot(2,1,1)
histogram(S_y)
title('mother random variable')
subplot(2,1,2)
histogram(cdf('Lognormal', S_y, S_mean, S_var))
title('CDF distribution')
4) This may be irrelevant given the points above but you are using different means and standard deviations to create the "mother_random_variable" and the cdf() data. For the random variables you are using (0.3, 0.5) for the mean and std but for the cdf you're using the mean and std of the data which are ~(1.5, 0.8)


Not satisfied with the answer ?? ASK NOW

Frequently Asked Questions

MATLAB offers tools for real-time AI applications, including Simulink for modeling and simulation. It can be used for developing algorithms and control systems for autonomous vehicles, robots, and other real-time AI systems.

MATLAB Online™ provides access to MATLAB® from your web browser. With MATLAB Online, your files are stored on MATLAB Drive™ and are available wherever you go. MATLAB Drive Connector synchronizes your files between your computers and MATLAB Online, providing offline access and eliminating the need to manually upload or download files. You can also run your files from the convenience of your smartphone or tablet by connecting to MathWorks® Cloud through the MATLAB Mobile™ app.

Yes, MATLAB provides tools and frameworks for deep learning, including the Deep Learning Toolbox. You can use MATLAB for tasks like building and training neural networks, image classification, and natural language processing.

MATLAB and Python are both popular choices for AI development. MATLAB is known for its ease of use in mathematical computations and its extensive toolbox for AI and machine learning. Python, on the other hand, has a vast ecosystem of libraries like TensorFlow and PyTorch. The choice depends on your preferences and project requirements.

You can find support, discussion forums, and a community of MATLAB users on the MATLAB website, Matlansolutions forums, and other AI-related online communities. Remember that MATLAB's capabilities in AI and machine learning continue to evolve, so staying updated with the latest features and resources is essential for effective AI development using MATLAB.

Without any hesitation the answer to this question is NO. The service we offer is 100% legal, legitimate and won't make you a cheater. Read and discover exactly what an essay writing service is and how when used correctly, is a valuable teaching aid and no more akin to cheating than a tutor's 'model essay' or the many published essay guides available from your local book shop. You should use the work as a reference and should not hand over the exact copy of it.

Matlabsolutions.com provides guaranteed satisfaction with a commitment to complete the work within time. Combined with our meticulous work ethics and extensive domain experience, We are the ideal partner for all your homework/assignment needs. We pledge to provide 24*7 support to dissolve all your academic doubts. We are composed of 300+ esteemed Matlab and other experts who have been empanelled after extensive research and quality check.

Matlabsolutions.com provides undivided attention to each Matlab assignment order with a methodical approach to solution. Our network span is not restricted to US, UK and Australia rather extends to countries like Singapore, Canada and UAE. Our Matlab assignment help services include Image Processing Assignments, Electrical Engineering Assignments, Matlab homework help, Matlab Research Paper help, Matlab Simulink help. Get your work done at the best price in industry.