A Influência do tamanho da amostra na credibilidade de testes estatísticos não-paramétricos: uma abordagem com simulações de Monte Carlo

Authors

DOI:

https://doi.org/10.34019/2179-3700.2024.v24.46207

Keywords:

Tamanho da Amostra, Testes Não-Paramétricos, Teste de Razão de Verossimilhanças, Teste de Pearson, Monte Carlo

Abstract

Statistics is fundamental in various scientific research. It plays a vital role by enabling the understanding of potential patterns and relationships present in the data, allowing researchers to draw meaningful conclusions and base their findings on solid evidence. The correct application of statistics ensures that the collected information is transformed into reliable and useful knowledge. However, its success largely depends on a key factor: sample size. Thus, the aim of this research project is to investigate how sample size impacts the decision of two of the main non-parametric statistical tests: the likelihood ratio and Pearson's tests. To achieve this objective, the project applied the Monte Carlo method, which allowed for the evaluation of the rejection rates of both tests in different scenarios. The variation in sample size revealed that, as the sample size increases, rejection rates become more consistent, approaching the ideal value of 5%. This suggests that larger sample sizes result in more reliable tests. The research also compared the likelihood ratio and Pearson's tests, concluding that both are equivalent, as they have similar rejection rates. Therefore, either test can be chosen for future analyses, as both are equally effective. In summary, this project highlighted the importance of sample size in non-parametric tests. As sample size increases, the results obtained from these two tests can be considered more reliable, making them more robust for analysis. These conclusions have significant implications for research and informed decision-making in various fields.

Downloads

Download data is not yet available.

References

KROESE, D. P.; BRERETON, T.; TAIMRE, T.; BOTEV, Z. I. Why the Monte Carlo method is so important today. WIREs Computational Statistics, v. 6, n. 6, p. 386-392, 2014.

LONGJOHN, R.; SMYTH, P.; STERN, H. S. Likelihood ratios for categorical count data with applications in digital forensics. Law, Probability and Risk, v. 21, n. 2, p. 91-122, 2022.

PEARSON, K. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science (Series 5), v. 50, n. 302, p. 157-175, 1900.

R CORE TEAM. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing, 2022.

TAYAR, D. O.; RIBEIRO JR., U.; CECCONELLO, I.; MAGALHÃES, T. M.; SIMÕES, C.; AULER JR., J. O. C. Propensity score matching comparison of laparoscopic versus open surgery for rectal cancer in a middle-income country: short-term outcomes and cost analysis. ClinicoEconomics and Outcomes Research, v. 10, p. 521-527, 2018.

WICKHAM, H. ggplot2: Elegant Graphics for Data Analysis. New York: Spring-Verlag, 2016.

WILKS, S. S. The large-sample distribution of likelihood ratio for testing composite hypotheses. The Annals of Mathematical Statistics, v. 9, n. 1, p. 60-62, 1938.

Published

2025-04-09

How to Cite

Magalhães, T. M., & Mendes, G. P. A. (2025). A Influência do tamanho da amostra na credibilidade de testes estatísticos não-paramétricos: uma abordagem com simulações de Monte Carlo. Principia: Caminhos Da Iniciação Científica, 24. https://doi.org/10.34019/2179-3700.2024.v24.46207

Issue

Section

Artigos originais - Ciências Exatas e da Terra