Szabó László István író, költő, az informatika tudományok tanára: AI machine learning naiv Bayes osztályozó

2026. június 12., péntek

AI machine learning naiv Bayes osztályozó

Ez a kód tökéletesen bemutatja az AI logikáját: van benne tanítási fázis (adatok beolvasása) és következtetési/predikciós fázis (döntéshozatal a tanult statisztikák alapján).a gépi tanulás (Machine Learning) alapjainak kézzel történő leprogramozása.Az alábbiakban egy egyszerű, mégis látványos példát mutatok: egy Naiv Bayes osztályozót, amely képes megtanulni és eldönteni szövegekről (pl. emailekről), hogy azok spamek-e vagy sem. Megszámolja a szavak előfordulási gyakoriságát, és Bayes tétele alapján számol valószínűséget.

------------------------

import math

import re

from collections import defaultdict

class NaiveBayesClassifier:

def __init__(self):

# Osztályok gyakorisága (spam / nem_spam)

self.class_counts = defaultdict(int)

# Szavak gyakorisága osztályonként: {osztály: {szó: darabszám}}

self.vocab_counts = defaultdict(lambda: defaultdict(int))

# Összes szó az adott osztályban

self.class_word_totals = defaultdict(int)

# Egyedi szavak halmaza

self.vocabulary = set()

def tokenize(self, text):

# Kisbetűsítés és szavakra bontás

return re.findall(r"\b\w+\b", text.lower())

def train(self, documents):

# documents egy lista: (szöveg, címke)

for text, label in documents:

self.class_counts[label] += 1

words = self.tokenize(text)

for word in words:

self.vocab_counts[label][word] += 1

self.class_word_totals[label] += 1

self.vocabulary.add(word)

def _calculate_prob(self, word, label):

# Laplace-simítás (hogy a 0-szor előforduló szavak ne nullázzák le a valószínűséget)

count = self.vocab_counts[label][word] + 1

total = self.class_word_totals[label] + len(self.vocabulary)

return math.log(count / total)

def predict(self, text):

words = self.tokenize(text)

predictions = {}

for label in self.class_counts:

# Kezdőérték az osztály relatív gyakorisága alapján

log_prob = math.log(

self.class_counts[label]

/ sum(self.class_counts.values())

)

# Szavak valószínűségének összeadása (logaritmus miatt)

for word in words:

if word in self.vocabulary:

log_prob += self._calculate_prob(word, label)

predictions[label] = log_prob

# Legnagyobb valószínűségű osztály kiválasztása

return max(predictions, key=predictions.get)

# --- Példa használat ---

# 1. Tanító adathalmaz (címkék: 'spam', 'normal')

training_data = [

("nyerj egy uj telefont ingyen", "spam"),

("kérlek hivj fel sürgősen", "spam"),

("ingyen penz sorsolas", "spam"),

("szia, holnap találkozunk a megbeszélésen?", "normal"),

("kérlek küldd el a dokumentumot", "normal"),

("ebédeljünk együtt?", "normal"),

]

# 2. Modell példányosítása és tanítása

model = NaiveBayesClassifier()

model.train(training_data)

# 3. Tesztelés

test_text_1 = "ingyen sorsolas telefon nyeremeny"

test_text_2 = "küldd el a dokumentumot légyszi"

print(f"'{test_text_1}' -> {model.predict(test_text_1)}")

print(f"'{test_text_2}' -> {model.predict(test_text_2)}")

---------------

'ingyen sorsolas telefon nyeremeny' -> spam

'küldd el a dokumentumot légyszi' -> normal

** Process exited - Return Code: 0 **

-------------------

Nincsenek megjegyzések:

Megjegyzés küldése

Curiculum vitea

Autobiography

Personal data:

Name: Laszlo Istvan Szabo

Email: szlip964@google.com

Age: 57 years

Education :

University College of Nyiregyhaza; degree Nyíregyhaza 2015 NYE University College of Nyiregyhaza; Education attained University education (Bachelor's degree) 2001 – 2005 NYF Natural Philosophy, Nyíregyhaza Field of study: information teacher.
1978 - 1982 Váry Emil Grammar School, Demecser Field of study: laboratory technician

Courses and training :

1997 - 1998, NYRMKK Name of course/training:

Publication Editor Certificate: OKJ superlative hour 800

1999 - 2000, NYRMKK Name of course/training: Computer mechanic

Certificate: OKJ superlative hour 900

2005 - 2006, Interdidact Name of course/training: information specialist

Certificate: OKJ superlative hour 1400

2010, Comp-School Name of course/training: project manager specialist

Certificate: basic hour 176
Employment history 2017 Wesselenyi secondary school Nyíregyháza

Certificate: basic hour 60
Employment history 2000 Inczédy secondary school Nyíregyháza

Job position: information teacher

Job description:

For the library's informatics system the development of the administrative system of his development and his extension, the institution system maintenance archiving, system supervision, for users' work the development of his support, his education, safety technology solutions, informatics professional developments, educations

Skills:

Language skills: Hungarian –native language

English - basic on a level

Russian - basic on a level

Administrative and economic skills:

Cash register - basic on a level

Human Resources - basic on a level

Typing - basic on a level

Warehouse management - basic on a level

Computer skills - user:

Internet (e-mail, www) - expert

Microsoft Word - expert

UNIX/Linux - expert

CorelDRAW - expert

Microsoft Windows - expert

Computer skills - programmer:

MySQL - expert

Borland Delphi - expert

programming languages;

C/C++ - expert

CSharp - expert

Pascal - expert

Fortran - expert

Cobol - expert

Python - expert

Perl - expert

Microsoft Visual Basic - expert

Net -expert

Computer skills - administrator:

LAN/WAN administration - expert

Windows 2016 server administration - expert

UNIX/Linux administration - expert

OpenVMS administration - begin

Driving licence:

1982. B; 56000 km

Place of work: Nyíregyháza, information technology,teacher

Adequacy:

To some extent I wrote this blog for
my friends. They can get to know each
from my thoughts. In addition I
wrote this blog for my thousands of students in
the Nyíregyháza where I work as
IT teacher. They can
use it...