Chii chiri k-zvinoreva Clustering?

Dhiyabhorosi yedhizinesi ne k-means algorithm

I- k-chirevo chekuunganidza zvinyorwa zvegorgorithm ndeyekutsvaga kwedhina uye chigadziro chemichina yekushandiswa chinoshandiswa kumasumbu ekucherechedza muzvikwata zvekucherechedza kwakabatana pasina ruzivo rwepakutanga hwehukama uhwu. Nokuenzanisa, iyo algorithm inoedza kuratidza kuti ndeupi, kana musasa, iyo data ndeye, uye nhamba yemasumbu inotsanangurwa nehuwandu k.

I- k-mean algorithm imwe yezvigadziriswa zvese zvakakosha uye inowanzoshandiswa mukufungidzira kwechiremba, biometrics, uye nemimwe mimwe midzi. Kubudirira kwek- kureva kusanganisa ndechokuti inotaurira nezve data yako (uchishandisa maitiro ayo asina kutarisirwa) pane kuti iwe udzidzise sargorithm pamusoro pe data pakutanga (uchishandisa fomu yakarongedzwa yeargorithm).

Iko dzimwe nguva kunonzi Lloyd's Algorithm, zvikurukuru mumakombiyuta emasayendisiti nokuti chimiro chegoridhedhi chakatanga kurongedzwa naStuart Lloyd muna 1957. Izwi rokuti "k-" rinoumbwa muna 1967 naJames McQueen.

Izvo zvinoreva k-zvinoreva Algorithm Mabasa

I- k-chimiro chegorgorithm ishanduko yekushanduka-shanduka iyo inowana zita rayo kubva pakugadzira kwayo. Iyo sarudzo yegadzirisiti inocherechedza mumapoka e, apo k inowanikwa semuenzaniso wekupinza. Iko zvino inopa maonero ose kumasumbu akavakirwa pamusana pekucherechedza kwekutsvaga kune zvinorehwa nechisumbu. Chikamu chacho chinoreveka ipapo chinodzorerwa uye iyo inotanga zvakare. Heino nzira iyo algorithm inoshanda nayo:

  1. Shanduro yegoridhe inogadzirisa zvinokonzera ma k points seyokutanga ma cluster centers (iyo nzira).
  2. Chinhu chimwe nechimwe mu dataset chinopiwa kumusango wakavharwa, zvichienderana neEuclidean kure pakati peji imwe neimwe uye musango rimwe nerimwe.
  3. Chikamu chimwe nechimwe chinokorodzwa sechiyero chemashoko muboka iroro.
  4. Matanho 2 ne3 anodzokorora kusvikira masumbu acho achinjwa. Kutendeuka kunogona kutsanangurwa nenzira yakasiyana zvichienderana nekushandiswa, asi zvinowanzoreva kuti kana kusaona kuchinja masumbu kana matanho 2 ne3 achidzokidzwa, kana kuti kuchinja hakuiti misiyano yepanyama mukutarisa kwemasumbu.

Kusarudza Nhamba yeCluster

Chimwe chezvinhu zvakanyanya kukanganisa k- zvinoreva kusanganiswa ndeyekuti iwe unofanirwa kujekesa nhamba yemasumbu sechikamu chechigadziriso. Sezvakagadzirwa, shanduro yacho haikwanise kugadzirisa nhamba yakakodzera yemasumbu uye inobva kune vashandi kuti vaone izvi zvisati zvaitika.

Semuenzaniso, kana iwe uine boka revanhu rinofanira kunge rakagadziriswa zvichienderana nehutano hwepabonde sehutano kana murume kana mukadzi, achidana k- nzira yekushandura kwegoridhe achishandisa mupiro k = 3 inogona kumanikidza vanhu mumasumbu matatu kana iviri chete, kana Purogiramu ye k = 2, yaizopa zvakakosha zvakasikwa.

Saizvozvowo, kana boka revanhu rakangoerekana rakanyonganiswa zvichienderana nemamiriro ekumusha uye iwe wakadana k- nzira yekushandura kwegoridhe nemubatsiro k = 20, zvingaguma zvingave zvakare zvakagadzirwa kuti zvibudirire.

Nokuda kwechikonzero ichi, kazhinji chinhu chakanaka chekuedza nemaitiro akasiyana e k kuti aone kukosha kunokodzera zvakakunakira. Iwe unogonawo kufarira kuongorora kushandiswa kwemamwe magwaro ekugadzirisa zvigadzirisheni mumagariro ako ekuziva ruzivo-ruzivo.