We delve into the intricacies of Gaussian Naive Bayes classification. The focus is on determining the probability of a data point belonging to a specific class among several, emphasizing probabilistic assessment over precise labeling. The article breaks down key concepts, from Bayesian decision theory to Bayes' theorem, and provides a step-by-step implementation using the Iris dataset.
All Posts
- Read more →
- Read more →
The topics covered include Degenerate Dimension, Conformed Dimension, Role-Playing Dimension, Junk Dimension, Outrigger Dimension, and Slowly Changing Dimensions (SCD). The SCD category further delves into different types, such as Type 0 to Type 7, each with its unique approach to handling historical and changing data.
- Read more →
In this article, I will introduce the concept of the Basic Fact table in Dimensional data modeling. To understand this technique, we will explore the different types of data modeling and recap some fundamental knowledge, including the star and snowflake schemas, and the concepts of normalization.
- Read more →
Some folks asked me about statistics - probability, toward which I answered that I had only studied a bit and knew very little about statistics. Then they said are these are one?. In fact, Statistics and Probability are distinct from one another
- Read more →
Xác suất và thống kê là khái niệm quen thuộc và thường đi cùng với nhau, chính vì vậy đôi khi chúng ta dễ nhầm lẫm rằng hai khái niệm này là một. Bài viết này làm rõ sự khác biệt giữa xác suất và thống kê, đồng thời đưa ra những định nghĩa về các khái niệm cơ bản trong lĩnh vực này.