Create ML Trouble Loading CSV to Train Word Tagger With Commas in Training Data

Question

Created Dec ’24

Replies 6

Boosts 0

Participants 2

I'm using Numbers to build a spreadsheet that I'm exporting as a CSV. I then import this file into Create ML to train a word tagger model. Everything has been working fine for all the models I've trained so far, but now I'm coming across a use case that has been breaking the import process: commas within the training data. This is a case that none of Apple's examples show.

My project takes Navajo text that has been tokenized by syllables and labels the parts-of-speech.

Case that works...

Raw text:

Naaltsoos yídéeshtah.

Tokens column:

Naal,tsoos, ,yí,déesh,tah,.

Labels column:

NObj,NObj,Space,Verb,Verb,VStem,Punct

Case that breaks...

Raw text:

óola, béésh łigaii, tłʼoh naadą́ą́ʼ, wáin, akʼah, dóó á,shįįh

Tokens column with tokenized text (commas quoted):

óo,la,",", ,béésh, ,łi,gaii,",", ,tłʼoh, ,naa,dą́ą́ʼ,",", ,wáin,",", ,a,kʼah,",", ,dóó, ,á,shįįh

(Create ML reports mismatched columns)

Tokens column with tokenized text (commas escaped):

óo,la,\,, ,béésh, ,łi,gaii,\,, ,tłʼoh, ,naa,dą́ą́ʼ,\,, ,wáin,\,, ,a,kʼah,\,, ,dóó, ,á,shįįh

(Create ML reports mismatched columns)

Tokens column with tokenized text (commas escape-quoted):

óo,la,\",\", ,béésh, ,łi,gaii,\",\", ,tłʼoh, ,naa,dą́ą́ʼ,\",\", ,wáin,\",\", ,a,kʼah,\",\", ,dóó, ,á,shįįh

(record not detected by Create ML)

Tokens column with tokenized text (commas escape-quoted):

óo,la,"","", ,béésh, ,łi,gaii,"","", ,tłʼoh, ,naa,dą́ą́ʼ,"","", ,wáin,"","", ,a,kʼah,"","", ,dóó, ,á,shįįh

(Create ML reports mismatched columns)

Labels column:

NSub,NSub,Punct,Space,NSub,Space,NSub,NSub,Punct,Space,NSub,Space,NSub,NSub,Punct,Space,NSub,Punct,Space,NSub,NSub,Punct,Space,Conj,Space,NSub,NSub

Sample From Spreadsheet

Screenshot 2024-12-22 at 4.43.08 PM.png

Solution Needed

It's simple enough to escape commas within CSV files, but the format needed by Create ML essentially combines entire CSV records into single columns, so I'm ending up needing a CSV record that contains a mixture of commas to use for parsing and ones to use as character literals. That's where this gets complicated.

For this particular use case (which seems like it would frequently arise when training a word tagger model), how should I properly escape a comma literal?

Boost

Answer 1

HullBreach OP

Dec ’24

In the (hopefully) short-term, I am able to export the Numbers spreadsheet as a TSV and have written up a crude converter to generate JSON from it that Create ML can properly handle. However, that adds an extra step that I would hope could be eliminated by directly exporting from Numbers for use in Create ML.

0

Answer 2

Frameworks Engineer OP

Apple

Dec ’24

Can you share the CSV file as exported by Numbers? All commas within a cell need to be quoted or escaped. In your example there are only a few quoted or escaped commas.

0

Answer 3

HullBreach OP

Dec ’24

I have attached the CSV. The lines in question are 290 and 291. Create ML will interpret those as 288 and 289, since it's zero-based and excludes the header row. Importing will work with no problems up to that point and will work with no problems if I delete both rows before exporting to CSV. If you open the CSV in Numbers, everything looks fine.

tokens-labels.csv

Counts,TOKENS,LABELS,English
MATCH,"Háá,góó, ,dí,ní,yá,?","Adv,Adv,Space,Verb,Verb,VStem,Punct",To where are you going?
MATCH,"ʼá,ko","Adv,Adv",then
MATCH,"ash,dla,di, ,mííl","NumC,NumC,NumC,Space,NumC","5,000"
MATCH,"ash,dla,di, ,neez,ná,diin","NumC,NumC,NumC,Space,NumC,NumC,NumC",500
MATCH,"díí, ,tsʼáa,dah","NumC,Space,NumC,NumC",fourteen
MATCH,"dį́įʼ,di, ,mííl","NumC,NumC,Space,NumC",four thousand
MATCH,"dį́įʼ,di, ,mííl,tsoh","NumC,NumC,Space,NumC,NumC",four million
MATCH,"dį́įʼ,di, ,neez,ná,diin","NumC,NumC,Space,NumC,NumC,NumC",four hundred
MATCH,"ha,stą́ą,di, ,mííl","NumC,NumC,NumC,Space,NumC",six thousand
MATCH,"ha,stą́ą,di, ,mííl,tsoh","NumC,NumC,NumC,Space,NumC,NumC",six million
MATCH,"tʼáá,łá,há,dí, ,mííl","NumC,NumC,NumC,NumC,Space,NumC",one thousand
MATCH,"tʼáá,łá,há,dí, ,neez,ná,diin","NumC,NumC,NumC,NumC,Space,NumC,NumC,NumC",one hundred
MATCH,"ash,dla,di, ,mííl,tsoh, ,bée,so","NumC,NumC,NumC,Space,NumC,NumC,Space,NSub,NSub","5,000,000 dollars"
MATCH,"naa,ki, ,tsʼáa,dah, ,bée,so","NumC,NumC,Space,NumC,NumC,Space,NSub,NSub",twelve dollars
MATCH,"naa,ki,di, ,mííl, ,ma,ʼii,dą́ą́ʼ","NumC,NumC,NumC,Space,NumC,Space,NSub,NSub,NSub",two thousand nightshades
MATCH,"ná,há,stʼéi,di, ,neez,ná,diin, ,ní,ma,sii","NumC,NumC,NumC,NumC,Space,NumC,NumC,NumC,Space,NSub,NSub,NSub",nine hundred potatoes
MATCH,"táa,di, ,mííl, ,tłʼí,zí","NumC,NumC,Space,NumC,Space,NSub,NSub",three thousand goats
MATCH,"táa,di, ,mííl,tsoh, ,sǫʼ","NumC,NumC,Space,NumC,NumC,Space,NSub",three million stars
MATCH,"tsee,bíí, ,tsʼáa,dah, ,tsʼaaʼ","NumC,NumC,Space,NumC,NumC,Space,NSub",eighteen baskets
MATCH,"tsee,bíi,di, ,mííl, ,tsís,ʼná","NumC,NumC,NumC,Space,NumC,Space,NSub,NSub",eight thousand bees
MATCH,"tsee,bíi,di, ,neez,ná,diin, ,tsé","NumC,NumC,NumC,Space,NumC,NumC,NumC,Space,NSub",eight hundred stones
MATCH,"tsos,tsʼi,di, ,mííl,tsoh, ,shi,ziiz","NumC,NumC,NumC,Space,NumC,NumC,Space,NPos,NSub",my seven million belts
MATCH,"tsos,tsʼi,di, ,neez,ná,diin, ,waaʼ","NumC,NumC,NumC,Space,NumC,NumC,NumC,Space,NSub",seven hundred spinaches
MATCH,"tʼáá,łá,há,dí, ,mííl,tsoh, ,tsin,dáo","NumC,NumC,NumC,NumC,Space,NumC,NumC,Space,NSub,NSub",one million cents
MATCH,"naa,ki,di, ,mííl,tsoh, ,mą,ʼii, ,dóó, ,naa,ki,di, mííl,tsoh, ,mą,ʼii,tsoh","NumC,NumC,NumC,Space,NumC,NumC,Space,NSub,NSub,Space,Conj,Space,NumC,NumC,NumC,Space,NumC,NumC,Space,NSub,NSub",two million coyotes and two million wolves
MATCH,"táá, ,tsʼáa,dah, ,tłʼoh, ,chin","NumC,Space,NumC,NumC,Space,NSub,Space,NSub",thirteen onions
MATCH,"Naa,ki, ,na,ʼa,hóó,hai, ,bi,tsįʼ, ,yi,shą́,.","NumC,NumC,Space,NObj,NObj,NObj,NObj,Space,NPos,NObj,Space,Verb,VStem,Punct",I am eating two chickens.
MATCH,"ash,dlaʼ, ,bée,so","NumC,NumC,Space,NSub,NSub",5 dollars
MATCH,"ash,dla,ʼáa,dah, ,a,tsá,tsoh","NumC,NumC,NumC,NumC,Space,NSub,NSub,NSub",15 golden eagles
MATCH,"ash,dlaʼ,diin, ,bée,so","NumC,NumC,NumC,Space,NSub,NSub",50 dollars
MATCH,"ha,stą́ą́, ,yáál","NumC,NumC,Space,NSub",6 bits (75 cents)
MATCH,"ha,stą́,diin, ,hash,kʼaan","NumC,NumC,NumC,Space,NSub,NSub",sixty bananas
MATCH,"łaʼ,tsʼáa,dah, ,hoo,ghan","NumC,NumC,NumC,Space,NSub,NSub",eleven homes
MATCH,"naa,diin, ,naa,dą́ą́","NumC,NumC,Space,NSub,NSub",twenty corn
MATCH,"naa,ki, ,a,gaan","NumC,NumC,Space,NSub,NSub",two arms
MATCH,"tá,diin, ,té,lii","NumC,NumC,Space,NSub,NSub",thirty donkeys
MATCH,"tʼáá,łá,ʼí, ,bée,so","NumC,NumC,NumC,Space,NSub,NSub",one dollar
MATCH,"Béé,ga,shii, ,bi,tsįʼ, ,dóó, ,naa,dą́ą́, ,yi,shą́,.","NObj,NObj,NObj,Space,NPos,NObj,Space,Conj,Space,NObj,NObj,Space,Verb,VStem,Punct",I am eating beef and corn.
MATCH,"Béé,ga,shii, ,bi,tsįʼ, ,yi,shą́,.","NObj,NObj,NObj,Space,NPos,NObj,Space,Verb,VStem,Punct",I am eating beef.
MATCH,"Bi,sóo,di, ,bi,tsįʼ, ,yį́,yą́,.","NObj,NObj,NObj,Space,NPos,NObj,Space,Verb,VStem,Punct",He eats pork.
MATCH,"a,beʼ","NSub,NSub",milk
MATCH,"a,chʼoozh,laaʼ","NSub,NSub,NSub",elbow
MATCH,"a,naaʼ","NSub,NSub",war
MATCH,"a,ná,diz","NSub,NSub,NSub",eyelash
MATCH,"a,ná,tsʼiin ","NSub,NSub,NSub",eyebrow
MATCH,"a,ná,tʼéézh","NSub,NSub,NSub",eyebrow
MATCH,"a,ná,zis","NSub,NSub,NSub",eyelid
MATCH,"a,tsįʼ","NSub,NSub",meat
MATCH,"a,yaa,tsʼiin","NSub,NSub,NSub",jaw
MATCH,"a,zeeʼ","NSub,NSub",drug
MATCH,"a,zǫ́ǫ́z","NSub,NSub",stinger
MATCH,"bi,má","NSub,NSub",her mother
MATCH,"dá,ʼá,kʼeh ","NSub,NSub,NSub",cornfield
MATCH,"káá,bin","NSub,NSub",carbon
MATCH,"ké,yah","NSub,NSub",land
MATCH,"kiił,tsoh,ʼííł,ké","NSub,NSub,NSub,NSub",kilobyte
MATCH,"ła,ʼííł,ké","NSub,NSub,NSub",byte
MATCH,"łéé,chąął,kiizh","NSub,NSub,NSub",dalmation
MATCH,"łee,jin","NSub,NSub",coal
MATCH,"łees,dí,sí","NSub,NSub,NSub",kiwi (bird)
MATCH,"łee,tsoh","NSub,NSub",uranium
MATCH,"łį́į́,tsa,ʼii","NSub,NSub,NSub",mare
MATCH,"naa,baa,hii","NSub,NSub,NSub",warrior
MATCH,"naal,ʼee,łí","NSub,NSub,NSub",duck
MATCH,"łéé,chąą,tsa,ʼii, ,bił, ,bi,łéé,chąą,yá,zhí ","NSub,NSub,NSub,NSub,Space,AdpStem,Space,NPos,NAdp,NAdp,NAdp,NAdp",***** (female dog) with her puppy
MATCH,"e,ʼe,ʼaah, ,dóó, ,ha,ʼa,ʼaah","NSub,NSub,NSub,Space,Conj,Space,NSub,NSub,NSub",west and east
MATCH,"gée,so, ,dóó, ,wáán","NSub,NSub,Space,Conj,Space,NSub",cheese and wine
MATCH,"łéé,chąą,ʼí, ,dóó, ,mó,sí","NSub,NSub,NSub,Space,Conj,Space,NSub,NSub",dog and cat
MATCH,"shi,béésh, ,dóó, ,shi,ghéél","NPos,NSub,Space,Conj,Space,NPos,NSub",my knife and my pack
MATCH,"shi,ná,diz, ,dóó, ,ni,ná,diz, ,dóó, ,bi,ná,diz","NPos,NSub,NSub,Space,Conj,Space,NPos,NSub,NSub,Space,Conj,Space,NPos,NSub,NSub",my eyelash and your eyelash and their eyelash
MATCH,"Shi,ma,sa,ní, ,tʼáá,łá,ʼí, ,bée,so, ,bee, ,hó,lǫ́,.","NPos,NSub,NSub,NSub,Space,NumC,NumC,NumC,Space,NObj,NObj,Space,Adp,Space,Verb,VStem,Punct",My maternal grandmother has one dollar.
MATCH,"Yii,tin,.","Verb,VStem,Punct",We freeze.
MATCH,"Daʼ, ,tʼáá,łá,há,dí, ,neez,ná,diin, ,bée,so, ,bee, ,hó,lǫ́,?","Adv,Space,NumC,NumC,NumC,NumC,Space,NumC,NumC,NumC,Space,NObj,NObj,Space,Adp,Space,Verb,VStem,Punct",Does he have one hundred dollars?
MATCH,"Daʼ, ,a,tsi,lí, ,bee, ,hó,lǫ́,?","Adv,Space,NObj,NObj,NObj,Space,Adp,Space,Verb,VStem,Punct",Does he have a younger brother?
MATCH,"dį́į́ʼ, ,yáál","NumC,Space,NSub",4 bits (50 cents)
MATCH,"Tó, ,ná,bish,.","NSub,Space,Verb,VStem,Punct",The water continues boiling.
MATCH,"Tó, ,shi,béézh,.","NSub,Space,Verb,VStem,Punct",The water has boiled.
MATCH,"Shí, ,désh,nish,.","NSub,Space,Verb,VStem,Punct",I started working.
MATCH,"Goh,wééh, ,ná,bish,.","NSub,NSub,Space,Verb,VStem,Punct",Coffee continues boiling.
MATCH,"Shash, ,łi,zhin, ,łóóʼ, ,yį́,yą́,.","NSub,Space,NSub,NSub,Space,NObj,Space,Verb,VStem,Punct",The black bear is eating fish.
MATCH,"Ash,kii, ,yá,zhí, ,bááh, ,yį́,yą́,.","NSub,NSub,Space,NSub,NSub,Space,NObj,Space,Verb,VStem,Punct",The little boy is eating bread.
MATCH,"naa,dą́ą́ʼ, ,bis,gąʼ ","NSub,NSub,Space,NSub,NSub",corn chip
MATCH,"doo,da","AdvNeg,AdvNeg",not
MATCH,"ʼał,dóʼ","Adv,Adv",also
MATCH,"ʼa,yóí","Adv,Adv",exceedingly
MATCH,"yá,ʼá,tʼééh","Inter,Inter,Inter",hello
MATCH,"ash,dla,ʼáa,dah, ,gó,neʼ","NumO,NumO,NumO,NumO,Space,NumO,NumO",fifteenth
MATCH,"neez,náá, ,gó,neʼ","NumO,NumO,Space,NumO,NumO",tenth
MATCH,"naa,ki, ,gó,neʼ, ,Corinthians","NumO,NumO,Space,NumO,NumO,Space,NSub",2 Corinthians
MATCH,"a,hé,heeʼ, ,shi,di,ne,ʼé","Inter,Inter,Inter,Space,NPos,NSub,NSub,NSub",Thank you my people
MATCH,"neez,náá, ,dah, ,woozh","NumC,NumC,Space,NSub,Space,NSub",ten strawberries
MATCH,"Dah, ,woozh, ,dao,są́ʼ,.","NSub,Space,NSub,Space,Verb,VStem,Punct",You are eating a strawberry.
MATCH,"Dah, ,woozh, ,yi,shą́,.","NSub,Space,NSub,Space,Verb,VStem,Punct",I am eating a strawberry.
MATCH,"Hash,tłʼish, ,goh,wééh, ,násh,dlį́į́h,.","NObj,NObj,Space,NObj,NObj,Space,Verb,VStem,Punct",I am drinking hot chocolate.
MATCH,"Hash,tłʼish, ,goh,wééh, ,shéł,beezh,.","NObj,NObj,Space,NObj,NObj,Space,Verb,VStem,Punct",I boiled hot chocolate.
MATCH,"Goh,wééh, ,násh,dlį́į́h,.","NObj,NObj,Space,Verb,VStem,Punct",I am drinking coffee.
MATCH,"Goh,wééh, ,yish,béézh,!","NObj,NObj,Space,Verb,VStem,Punct",The coffee boiled!
MATCH,"Łóóʼ, ,yi,shą́,.","NObj,Space,Verb,VStem,Punct",I am eating fish.
MATCH,"Łóóʼ, ,yí,yą́ą́,.","NObj,Space,Verb,VStem,Punct",I ate fish.
MATCH,"dil,yį́,hí","NObj,NObj,NObj",lead (metal)
MATCH,"ni,chi,dí","NPos,NSub,NSub",your car
MATCH,"ni,hí","Pro,Pro",we
MATCH,"níł,chʼih","NSub,NSub",air
MATCH,"níł,chi","NSub,NSub",air
MATCH,"níł,chʼi,tsoh","NSub,NSub,NSub",December
MATCH,"ní,ló","NSub,NSub",hail
MATCH,"níł,tsą́","NSub,NSub",rain
MATCH,"ni,náá,hai","NPos,NSub,NSub",your age
MATCH,"ní,yol","NSub,NSub",wind
MATCH,"óo,la","NSub,NSub",gold
MATCH,"shi,kʼaʼ","NPos,NSub",my arrow
MATCH,"shí,laʼ","NPos,NSub",my hand
MATCH,"shi,leezh","NPos,NSub",my soil
MATCH,"shi,náá,hai","NPos,NSub,NSub",my age
MATCH,"shi,toʼ","NPos,NSub",my water
MATCH,"shi,yéél","NPos,NSub",my pack
MATCH,"si,tseʼ","NPos,NSub",my stone
MATCH,"si,tsʼaaʼ","NPos,NSub",my basket
MATCH,"si,zǫʼ","NPos,NSub",my star
MATCH,"tsé,kooh","NSub,NSub",canyon
MATCH,"tsé,sǫʼ","NSub,NSub",window
MATCH,"tʼą́ą́,chil","NSub,NSub",April
MATCH,"tʼą́ą́,tsoh","NSub,NSub",May
MATCH,"wóózh,chʼį́į́d","NSub,NSub",March
MATCH,"ya,ʼiish,jáásh,chi,lí","NSub,NSub,NSub,NSub,NSub",June
MATCH,"ya,ʼiish,jáásh,tsoh","NSub,NSub,NSub,NSub",July
MATCH,"yis,nááh","NSub,NSub",prisoner of war
MATCH,"Yił,béézh,.","Verb,VStem,Punct",It is boiling.
MATCH,"Yish,áál,.","Verb,VStem,Punct",I am walking along.
MATCH,"Yá,ʼá,tʼééh, ,shi,zhe,ʼé,!","Inter,Inter,Inter,Space,NPos,NSub,NSub,Punct","Hello, my father!"
MATCH,ndi,Conj,but
MATCH,"díz,diin","NumC,NumC",forty
MATCH,baaʼ,NSub,war
MATCH,ni,Pro,your
MATCH,shí,Pro,I
MATCH,bee,Adp,with
MATCH,bił,Pro,she
MATCH,"daa,bí","Pro,Pro",they
MATCH,"daa,hó","Pro,Pro",they
MATCH,nléí,Pro,one
MATCH,éí,Pro,that
MATCH,shaa,Pro,me
MATCH,"Jii,hó,vah","NSub,NSub,NSub",Jehovah
MATCH,kʼos,NSub,cloud
MATCH,łį́įʼ,NSub,horse
MATCH,sęęs,NSub,wart
MATCH,"Shí,yeʼ, ,naa,ki, ,bée,so, ,dóó, ,díí, ,yáál, ,bee, ,hó,lǫ́,.","NPos,NSub,Space,NumC,NumC,Space,NObj,NObj,Space,Conj,Space,NumC,Space,NObj,Space,Adp,Space,Verb,VStem,Punct",My son has two dollars and fifty cents.
MATCH,"Shi,ma,sa,ní, ,tsos,tsʼid, ,shash, ,łi,zhin, ,bee, ,hó,lǫ́,.","NPos,NSub,NSub,NSub,Space,NumC,NumC,Space,NObj,Space,NObj,NObj,Space,Adp,Space,Verb,VStem,Punct",My maternal grandmother has seven black bears.
MATCH,"Dlǫ́ǫ́ʼ, ,ná,há,stʼéí, ,chʼi, ,łi,chxí,ʼí, ,yį́,yą́,.","NSub,Space,NumC,NumC,NumC,Space,NObj,Space,NObj,NObj,NObj,Space,Verb,VStem,Punct",The prairie dog is eating nine tomatoes.
MATCH,"Na,ʼa,hóó,hai, ,naa,ki, ,na,ʼa,hóó,hai, ,bi,tsįʼ, ,yį́,yą́,.","NSub,NSub,NSub,NSub,Space,NumC,NumC,Space,NObj,NObj,NObj,NObj,Space,NPos,NObj,Space,Verb,VStem,Punct",The chicken is eating two chicken meats.
MATCH,"Béé,ga,shii, ,a,niiʼ, ,bee, ,hó,lǫ́,.","NSub,NSub,NSub,Space,NObj,NObj,Space,Adp,Space,Verb,VStem,Punct",The cow has a face.
MATCH,"Béé,ga,shii, ,a,tsooʼ, ,bee, ,hó,lǫ́,.","NSub,NSub,NSub,Space,NObj,NObj,Space,Adp,Space,Verb,VStem,Punct",The cow has a ******.
MATCH,"Shí, ,té,lii, ,bee, ,hó,lǫ́,.","Pro,Space,NObj,NObj,Space,Conj,Space,Verb,VStem,Punct",I have a donkey.
MATCH,"Shi,má, ,bi,di,bé, ,dóó, ,bi,tłʼí,zí, ,dóó, ,bi,ghan, ,bee, ,hó,lǫ́,.","NPos,NSub,Space,NPos,NObj,NObj,Space,Conj,Space,NPos,NObj,NObj,Space,Conj,Space,NPos,NObj,Space,Adp,Space,Verb,VStem,Punct",My mother has her sheep and her goat and her house.
MATCH,"Shi,má, ,naa,dą́ą́, ,dóó, ,ní,ma,sii, ,yį́,yą́,.","NPos,NSub,Space,NObj,NObj,Space,Conj,Space,NObj,NObj,NObj,Space,Verb,VStem,Punct",My mother is eating corn and a potato.
MATCH,"Shí,yeʼ, ,na,ʼa,hóó,hai, ,bi,tsįʼ, ,dóó, ,ní,ma,sii, ,yį́,yą́,.","NPos,NSub,Space,NObj,NObj,NObj,NObj,Space,NPos,NObj,Space,Conj,Space,NObj,NObj,NObj,Space,Verb,VStem,Punct",My son is eating chicken and potatoes.
MATCH,"Na,ʼa,hóó,hai, ,na,ʼa,hóó,hai, ,bi,tsįʼ, ,yį́,yą́,.","NSub,NSub,NSub,NSub,Space,NObj,NObj,NObj,NObj,Space,NPos,NObj,Space,Verb,VStem,Punct",The chicken is eating chicken meat.
MATCH,"Ma,gí, ,á,lá,tsíín, ,hó,lǫ́,.","NSub,NSub,Space,NObj,NObj,NObj,Space,Verb,VStem,Punct",The monkey has a wrist.
MATCH,"Ma,gí, ,a,nííʼ, ,hó,lǫ́,.","NSub,NSub,Space,NObj,NObj,Space,Verb,VStem,Punct",The monkey has a cheek.
MATCH,"Di,né, ,yá,zhí, ,yááł,tiʼ,.","NSub,NSub,Space,NSub,NSub,Space,Verb,VStem,Punct",The midget talked.
MATCH,"Tsé,kooh, ,Ha,tsoh","NSub,NSub,Space,NSub,NSub",Grand Canyon
MATCH,"na,ʼa,hóó,hai, ,bi,tsįʼ","NSub,NSub,NSub,NSub,Space,NPos,NSub",chicken meat
MATCH,"ha,stiin, ,bi,łį́įʼ","NSub,NSub,Space,NPos,NSub",the manʼs horse
MATCH,"hash,tłʼish, ,goh,wééh","NSub,NSub,Space,NSub,NSub",hot chocolate
MATCH,"chʼil, ,łi,chxí,ʼí","NSub,Space,NSub,NSub,NSub",tomato
MATCH,"chʼil, ,łi,tsxooí","NSub,Space,NSub,NSub",orange
MATCH,"bi,sóo,di, ,bi,tsįʼ","NSub,NSub,NSub,Space,NPos,NSub",pork
MATCH,"béésh, ,óo,la","NSub,Space,NSub,NSub",gold
MATCH,"bée,so, ,yá,zhí","NSub,NSub,Space,NSub,NSub",coin (small money)
MATCH,"béésh, ,ás,zó,lí","NSub,Space,NSub,NSub,NSub",aluminum
MATCH,"béésh, ,í,lį́į,nii ","NSub,Space,NSub,NSub,NSub",precious metal
MATCH,"béésh, ,łi,chí,ʼí ","NSub,Space,NSub,NSub,NSub",copper (or bronze)
MATCH,"béésh, ,łi,gai","NSub,Space,NSub,NSub",silver
MATCH,"béésh, ,łi,tsoii","NSub,Space,NSub,NSub",brass
MATCH,"a,wééʼ, ,yá,zhí","NSub,NSub,Space,NSub,NSub",little baby
MATCH,"ha,zhó,ʼó,go","Adv,Adv,Adv,Adv",carefully
MATCH,"Ha,zhóó,ʼó,go, ,yááł,tiʼ,.","Adv,Adv,Adv,Adv,Space,Verb,VStem,Punct","Slowly, I spoke."
MATCH,"ha,stą́ą,di, ,neez,ná,diin, ,dóó, ,bi,ʼaan, ,ha,stą́ą́, ,chʼééh,ji,yáán","NumC,NumC,NumC,Space,NumC,NumC,NumC,Space,Conj,Space,Conj,Conj,Space,NumC,NumC,Space,NSub,NSub,NSub",six hundred six watermelons
MATCH,"táa,di, ,neez,ná,diin, ,dóó, ,bi,ʼaan, ,naa,ki, ,tłʼoh, ,naa,dą́ą́ʼ","NumC,NumC,Space,NumC,NumC,NumC,Space,Conj,Space,Conj,Conj,Space,NumC,NumC,Space,NSub,Space,NSub,NSub",three hundred two wheat
MATCH,"ná,há,stʼé,diin, ,naal,tsoos, ,bá, ,hoo,ghan","NumC,NumC,NumC,NumC,Space,NSub,NSub,Space,Adp,Space,NAdp,NAdp",ninety books for the house
MATCH,"díz,diin, ,jı̨́, ,dóó, ,díz,diin, ,tłééʼ","NumC,NumC,Space,NSub,Space,Conj,Space,NumC,NumC,Space,NSub",40 days and 40 nights
MATCH,"náz,bas, ,bée,so, ,dóó, ,naa,ki, ,yáál","NumC,NumC,Space,NSub,NSub,Space,Conj,Space,NumC,NumC,Space,NSub",0 dollars and 2 bits (25 cents)
MATCH,"tsos,tsʼi,diin, ,tą,zhii, ,bi,tsįʼ","NumC,NumC,NumC,Space,NSub,NSub,Space,NPos,NSub",seventy turkey meats
MATCH,"Na,ʼa,hóó,hai, ,bi,tsįʼ, ,dóó, ,naa,dą́ą́, ,dóó, ,neez,náá, ,dah, ,woozh, ,yi,shą́,.","NObj,NObj,NObj,NObj,Space,NPos,NObj,Space,Conj,Space,NObj,NObj,Space,Conj,Space,NumC,NumC,Space,NObj,Space,NObj,Space,Verb,VStem,Punct",I am eating chicken meat and corn and ten strawberries.
MATCH,"Haash, ,yi,níl,yé,?","Adv,Space,Verb,Verb,VStem,Punct",What is your name?
MATCH,"Tó, ,yish,béézh,.","NSub,Space,Verb,VStem,Punct",The water boiled.
MATCH,"bííł,tsoh,ʼííł,ké","NumC,NumC,NSub,NSub",zettabyte
MATCH,"bi,nii,naa","NSub,NSub,NSub",warning
MATCH,"cha,ha,ʼoh","NSub,NSub,NSub",shade
MATCH,"di,dzé,tsoh","NSub,NSub,NSub",peach
MATCH,"dį́į́ł,tsoh,ʼííł,ké","NumC,NumC,NSub,NSub",gigabyte
MATCH,"dlaał,tsoh,ʼííł,ké","NumC,NumC,NSub,NSub",terabyte
MATCH,"naal,yé,hé","NSub,NSub,NSub",goods
MATCH,"na,nool,zheeʼ ","NSub,NSub,NSub",thread
MATCH,"na,ʼii,dzeeł","NSub,NSub,NSub",dream
MATCH,"ndí,yí,lii,tsoh","NumC,NumC,NSub,NSub",sunflower
MATCH,"ni,hí,laʼ","NPos,NPos,NSub",our hand
MATCH,"są́ą́ł,tsoh,ʼííł,ké","NumC,NumC,NSub,NSub",petabyte
MATCH,"tááł,tsoh,ʼííł,ké","NumC,NumC,NSub,NSub",megabyte
MATCH,"ʼa,tooʼ","NPos,NPos",stew
MATCH,"Ni,deesh,neeł,.","Verb,Verb,VStem,Punct",I will play.
MATCH,"Ni,deesh,nish,.","Verb,Verb,VStem,Punct",I will work.
MATCH,"Ná,shi,diił,tʼeʼ,.","Verb,Verb,Verb,VStem,Punct",She woke me up.
MATCH,"Yá,ʼá,nísh,tʼééh,.","Verb,Verb,Verb,VStem,Punct",I am well.
MATCH,"Tá,di,deesh,ááł,.","Verb,Verb,Verb,VStem,Punct",I will walk around.
MATCH,"Níínsh,chʼil,.","Verb,VStem,Punct",My eyes are closed
MATCH,"Naa,né,.","Verb,VStem,Punct",He is playing.
MATCH,"Naa,shné,.","Verb,VStem,Punct",I am playing.
MATCH,"Naaʼ,naʼ,.","Verb,VStem,Punct",It crawls about.
MATCH,"Dish,ghaał,.","Verb,VStem,Punct",I am opening my eyes.
MATCH,"Díí,nísh,ʼį́į́ʼ,.","Verb,Verb,VStem,Punct",My eyes are open.
MATCH,"Dí,néesh,daał,.","Verb,Verb,VStem,Punct",I will sit.
MATCH,"Da,nii,dlį́,.","Verb,Verb,VStem,Punct",We are.
MATCH,"Daol,yé,.","Verb,VStem,Punct",They are called.
MATCH,"ʼa,wééʼ, ,bi,di,yé,saʼ,.","NSub,NSub,Space,Verb,Verb,Verb,VStem,Punct",The baby burped.
MATCH,"Di,bé, ,da,di,tléé, ,dóó, ,da,biʼ,nii,dlí,.","NSub,NSub,Space,Verb,Verb,VStem,Space,Conj,Space,Verb,Verb,Verb,VStem,Punct",The sheep are wet and cold.
MATCH,"Shí, ,ʼá,kwíi,ni,sin,.","NSub,Space,Verb,Verb,Verb,VStem,Punct",That is what I think.
MATCH,"Goh,wééh, ,shi,béézh,.","NSub,NSub,Space,Verb,VStem,Punct",The coffee has boiled.
MATCH,"Di,chin, ,ni,shłį́,.","NSub,NSub,Space,Verb,VStem,Punct",I am hunger. (I am very hungry.)
MATCH,"Di,chin, ,ni,sin,.","NSub,NSub,Space,Verb,VStem,Punct",I want hunger. (I am hungry.)
MATCH,"Da,niel, ,á,ní,.","NSub,NSub,Space,Verb,VStem,Punct",Daniel says.
MATCH,"Di,bááʼ, ,ni,shłį́,.","NSub,NSub,Space,Verb,VStem,Punct",I am thirst. (I am very thirsty.)
MATCH,"Di,bááʼ, ,ni,sin,.","NSub,NSub,Space,Verb,VStem,Punct",I want thirst. (I am thirsty.)
MATCH,"A,tʼééd, ,yá,zhí, ,ał,hosh,.","NSub,NSub,Space,NSub,NSub,Space,Verb,VStem,Punct",The little girl sleeps.
MATCH,"jooł, ,ní,maz,go, ,yi,ta,lí, ,dóó, ,jooł, ,yi,ta,lí","NSub,Space,NSub,NSub,NSub,Space,NSub,NSub,NSub,Space,Conj,Space,NSub,Space,NSub,NSub,NSub",soccer ball and football
MATCH,"bi,má, ,ya,zhí, ,są,ní","NPos,NSub,Space,NSub,NSub,Space,NSub,NSub",her great aunt
MATCH,"Naa,ki, ,mó,sí, ,dóó, ,tááʼ, ,łéé,chąą,ʼí, ,na,ʼa,hóó,hai, ,bi,tsįʼ, ,daa,yą́,.","NumC,NumC,Space,NSub,NSub,Space,Conj,Space,NumC,Space,NSub,NSub,NSub,Space,NObj,NObj,NObj,NObj,Space,NPos,NObj,Space,Verb,VStem,Punct",Two cats and three dogs eat chicken meat.
MATCH,"í,lį́į,go, ,naal,yé,hé, ,bee, ,doo,tłizh","NSub,NSub,NSub,Space,NSub,NSub,NSub,Space,Adp,Space,NAdp,NAdp",jewelry with turquoise
MATCH,"Sǫ,ʼtah, ,A,nah","NSub,NSub,Space,NSub,NSub",Star Wars
MATCH,"łéé,chąą,ʼí, ,bi,ghan ","NSub,NSub,NSub,Space,NPos,NSub",dog kennel
MATCH,"Jáan, ,bi,má","NSub,Space,NPos,NSub",Johnʼs mother
MATCH,"di,yin, ,bi,zaad","NSub,NSub,Space,NPos,NSub",Holy Bible
MATCH,"dah, ,woozh","NSub,Space,NSub",strawberry
MATCH,"da,móo, ,yá,zhí","NSub,NSub,Space,NSub,NSub",Saturday
MATCH,"béésh, ,a,deeʼ","NSub,Space,NSub,NSub",spoon
MATCH,"a,zeeʼ, ,íí,łʼí,ní","NSub,NSub,Space,NSub,NSub,NSub",doctor
MATCH,"a,zeeʼ, ,nei,ka,hi","NSub,NSub,Space,NSub,NSub,NSub",nurse
MATCH,"béé,ga,shii, ,bi,tsįʼ","NSub,NSub,NSub,Space,NPos,NSub",beef
MATCH,"Shí, ,ni,chi,dí, ,ʼa,dis,bąąs,.","NSub,Space,NObj,NObj,NObj,Space,Verb,Verb,VStem,Punct",I am starting to drive your car.
MATCH,"Shí, ,ʼa,wééʼ, ,bi,di,yé,saʼ,.","NSub,Space,NObj,NObj,Space,Verb,Verb,Verb,VStem,Punct",I burped the baby.
MATCH,"Shí, ,ʼa,wééʼ, ,bi,yeesh,dloh,.","NSub,Space,NObj,NObj,Space,Verb,Verb,VStem,Punct",I made the baby laugh.
MATCH,"Shí,yeʼ, ,waaʼ, ,yį́,yą́,.","NPos,NSub,Space,NObj,Space,Verb,VStem,Punct",My son is eating spinach.
MATCH,"Shash, ,łóóʼ, ,yį́,yą́,.","NSub,Space,NObj,Space,Verb,VStem,Punct",The bear is eating fish.
MATCH,"Na,ʼa,hóó,hai, ,naa,dą́ą́, ,yį́,yą́,.","NSub,NSub,NSub,NSub,Space,NObj,NObj,Space,Verb,VStem,Punct",The chicken is eating corn.
MATCH,"Ní,ma,sii, ,bił, ,yá,ʼá,tʼééh,.","NObj,NObj,NObj,Space,Pro,Space,Verb,Verb,VStem,Punct",She likes potatoes.
MATCH,"Di,né,tsoh, ,haash, ,wol,yé,?","NObj,NObj,NObj,Space,Adv,Space,Verb,VStem,Punct",What is the big manʼs name?
MATCH,"Gah, ,haash, ,wol,yé,?","NObj,Space,Adv,Space,Verb,VStem,Punct",What is the rabbitʼs name?
MATCH,"Gó,lí,zhii, ,haash, ,wol,yé,?","NObj,NObj,NObj,Space,Adv,Space,Verb,VStem,Punct",What is the skunkʼs name?
MATCH,"A,tʼééd, ,tsí,dii, ,bish,tąsh,.","NSub,NSub,Space,NObj,NObj,Space,Verb,VStem,Punct",The girl was pecked by the bird.
MATCH,"Béé,ga,shii, ,haash, ,wol,yé,?","NObj,NObj,NObj,Space,Adv,Space,Verb,VStem,Punct",What is the cowʼs name?
MATCH,"Ash,kii, ,bááh, ,yį́,yą́,.","NSub,NSub,Space,NObj,Space,Verb,VStem,Punct",The boy is eating bread.
MATCH,"Ash,kii, ,łóóʼ, ,yi,ní,łʼį́,.","NSub,NSub,Space,NObj,Space,Verb,Verb,VStem,Punct",The boy is looking at the fish.
MATCH,"A,tsá, ,tłʼí,zí, ,yį́,yą́,.","NSub,NSub,Space,NObj,NObj,Space,Verb,VStem,Punct",The eagle eats a goat.
MATCH,"Ash,kii, ,a,tʼééd, ,yi,ní,łʼį́,.","NSub,NSub,Space,NObj,NObj,Space,Verb,Verb,VStem,Punct",The boy is looking at the girl.
MATCH,"Shí, ,éí, ,shi,cheii, ,hó,lǫ́,.","NSub,Space,Pro,Space,NPos,NObj,Space,Verb,VStem,Punct",I have a maternal grandfather.
MATCH,"Shí, ,éí, ,shi,ma,sa,ní, ,hó,lǫ́,.","NSub,Space,Pro,Space,NPos,NObj,NObj,NObj,Space,Verb,VStem,Punct",I have a maternal grandmother.
MATCH,"Shi,zhe,ʼé, ,łéé,chąą,ʼí, ,bił, ,yá,ʼá,tʼééh,.","NPos,NSub,NSub,Space,NObj,NObj,NObj,Space,Pro,Space,Verb,Verb,VStem,Punct",My father likes dogs.
MATCH,"Dóo,la, ,naa,dą́ą́, ,dóó, ,ní,ma,sii, ,bił, ,yá,ʼá,tʼééh,.","NSub,NSub,Space,NObj,NObj,Space,Conj,Space,NObj,NObj,NObj,Space,Pro,Space,Verb,Verb,VStem,Punct",The bull likes corn and potatoes.
MATCH,chąąʼ,NSub,*****
MATCH,"Á,daa,ʼá,hál,yą́,.","Verb,Verb,Verb,Verb,VStem,Punct",He/She cares for himself/herself
MATCH,"Á,daa,ʼá,hásh,yą́,.","Verb,Verb,Verb,Verb,VStem,Punct",I care for myself.
MATCH,"A,héé,hí,shííh","NSub,NSub,NSub,NSub",California
MATCH,"shi,cheii","NPos,NSub",my maternal grandfather
MATCH,"tłʼéé,ʼho,naa,ʼéí","NSub,NSub,NSub,NSub",nighttime
MATCH,"tsa,ʼii ","NSub,NSub",female
MATCH,"ndi,kʼąʼ","NSub,NSub",cotton
MATCH,"ndil,kal","NSub,NSub",gourd
MATCH,"ndaal,ʼa,ʼí ","NSub,NSub,NSub",*******
MATCH,"hi,łii,jį́į́ʼ ","NSub,NSub,NSub",dusk
MATCH,"bé,ʼé,zhóóʼ","NSub,NSub,NSub",hairbrush
MATCH,"a,kąʼ","NSub,NSub",male
MATCH,"Naal,tsoos, ,yí,déesh,tah,.","NObj,NObj,Space,Verb,Verb,VStem,Punct",I will read the book.
MATCH,"Naal,tsoos, ,yił,taʼ,.","NObj,NObj,Space,Verb,VStem,Punct",He is reading the book.
MATCH,"Naal,tsoos, ,yí,nísh,taʼ,.","NObj,NObj,Space,Verb,Verb,VStem,Punct",I am reading the book.
MATCH,"Tó, ,násh,dlį́į́h,.","NObj,Space,Verb,VStem,Punct",I drink water.
MATCH,"Bi,sóo,di, ,bi,tsįʼ, ,bił, ,yá,ʼá,tʼééh,.","NObj,NObj,NObj,Space,NPos,NObj,Space,Pro,Space,Verb,Verb,VStem,Punct",He likes pork.
MATCH,"Tó, ,dóó, ,chʼi,yáán, ,tʼáá,gééd,.","NObj,Space,Conj,Space,NObj,NObj,Space,Verb,VStem,Punct",He lacked food and water.
MATCH,"ʼáá,dóó, ,ʼa,wééʼ, ,bi,yeesh,dloh,.","Conj,Conj,Space,NSub,NSub,Space,Verb,Verb,VStem,Punct",And then the baby laughed.
MATCH,"Dóó, ,ho,nóoł,nééł,.","Conj,Space,Verb,Verb,VStem,Punct",And he was winning.
MATCH,"Háá,góó, ,a,yíí,ʼą́,?","Adv,Adv,Space,Verb,Verb,VStem,Punct",To where did they take her?
MATCH,"Háa,dish, ,niʼ,nis,bąąs,?","Adv,Adv,Space,Verb,Verb,VStem,Punct",Where can I park it (the car)?
MATCH,"Háa,dish, ,tí,giʼ, ,nda,ha,niih,?","Adv,Adv,Space,NObj,NObj,Space,Verb,Verb,VStem,Punct",Where can I buy a ticket?
MATCH,"Háa,dish, ,bée,so, ,bá,hoo,ghan,?","Adv,Adv,Space,NObj,NObj,Space,Verb,Verb,VStem,Punct",Where is the bank?
MATCH,"yis,ká̜a̜,go","NSub,NSub,NSub",tomorrow
MATCH,"ʼa,dą́ą́,dą́ą́ʼ","NSub,NSub,NSub",yesterday
MATCH,"Éí, ,yá,ʼá,tʼééh,.","Pro,Space,Verb,Verb,VStem,Punct",It is good.
MATCH,"Yi,béézh,.","Verb,VStem,Punct",It boils.
MATCH,"Yi,dee,sįįł,.","Verb,Verb,VStem,Punct",I will stand.
MISMATCH,"óo,la,"","", ,béésh, ,łi,gaii,"","", ,tłʼoh, ,naa,dą́ą́ʼ,"","", ,wáin,"","", ,a,kʼah,"","", ,dóó, ,á,shįįh","NSub,NSub,Punct,Space,NSub,Space,NSub,NSub,Punct,Space,NSub,Space,NSub,NSub,Punct,Space,NSub,Punct,Space,NSub,NSub,Punct,Space,Conj,Space,NSub,NSub","gold, silver, wheat, wine, oil, and salt"
MISMATCH,"?,.,!,"","","",:,;","Punct,Punct,Punct,Punct,Punct,Punct,Punct",
MATCH,"Níł,jool,.","Verb,VStem,Punct",Give me (non-compact matter)
MATCH,"Ní,tįįh,.","Verb,VStem,Punct","Give me (stiff, slender object)"
MATCH,"Hó,zhǫ́, ,ná,hás,dlį́į́ʼ,.","NSub,NSub,Space,Verb,Verb,Verb,Punct",Beauty has come again.
MATCH,"Éí, ,a,di,ní,díín,.","Pro,Space,Verb,Verb,Verb,VStem,Punct",It is sunny.
MATCH,"Éí, ,na,hał,tin,.","Pro,Space,Verb,Verb,VStem,Punct",It is raining.
MATCH,"Di,bé, ,łi,gai,.","NSub,NSub,Space,Verb,VStem,Punct",The sheep is white.
MATCH,"Bįįh, ,yil,dee,ʼį́, ,łi,chííʼ,.","NSub,Space,NSub,NSub,NSub,Space,Verb,VStem,Punct",The cherry is red.
MATCH,"Di,bé, ,yá,zhí, ,łi,gai,.","NSub,NSub,Space,NSub,NSub,Space,Verb,VStem,Punct",The lamb is white.
MATCH,"Níł,chʼi, ,di,ne,ʼé, ,hó,lǫ́,.","NSub,NSub,Space,NSub,NSub,NSub,Space,Verb,VStem,Punct",Spirits exist.
MATCH,"A,tsooʼ, ,łi,chííʼ,.","NSub,NSub,Space,Verb,VStem,Punct",The ****** is red.
MATCH,"Dził, ,bi,cha,ha,ʼoh, ,ké,yah, ,bi,kʼe,stʼiʼ,.","NSub,Space,NPos,NSub,NSub,NSub,Space,NObj,NObj,Space,Verb,Verb,VStem,Punct",The mountainʼs shadow covers the land.
MATCH,"łéé,chąą,ʼí, ,bi,chʼi,yąʼ","NSub,NSub,NSub,Space,NPos,NSub,NSub",dog food
MATCH,"Shí, ,a,nii,tsį́ʼ, ,hó,lǫ́,.","Pro,Space,NObj,NObj,NObj,Space,Verb,VStem,Punct",I have a cheek.
MATCH,"Shí, ,a,tsooʼ, ,hó,lǫ́,.","Pro,Space,NObj,NObj,Space,Verb,VStem,Punct",I have a ******.
MATCH,"Ná,hoo,kǫs, ,éí, ,a,di,ní,díín,.","NSub,NSub,NSub,Space,Pro,Space,Verb,Verb,Verb,VStem,Punct",The north is sunny.
MATCH,"Ná,hoo,kǫs, ,éí, ,na,hał,tin,.","NSub,NSub,NSub,Space,Pro,Space,Verb,Verb,VStem,Punct",The north is raining.
MATCH,"Shá,di,ʼááhʼ, ,éí, ,a,di,ní,díín,.","NSub,NSub,NSub,Space,Pro,Space,Verb,Verb,Verb,VStem,Punct",The south is sunny.
MATCH,"Shá,di,ʼááhʼ, ,éí, ,na,hał,tin,.","NSub,NSub,NSub,Space,Pro,Space,Verb,Verb,VStem,Punct",The south is raining.
MATCH,"Díí,jį́ʼ, ,éí, ,a,di,ní,díín,.","NSub,NSub,Space,Pro,Space,Verb,Verb,Verb,VStem,Punct",Today is sunny.
MATCH,"Díí,jį́ʼ, ,éí, ,na,hał,tin,.","NSub,NSub,Space,Pro,Space,Verb,Verb,VStem,Punct",Today is raining.
MATCH,"Á,dish,niʼ,.","Verb,Verb,VStem,Punct",I am blinking.
MATCH,"Ał,hosh,.","Verb,VStem,Punct",He/She sleeps.
MATCH,"Bí,hoo,shʼaah,.","Verb,Verb,VStem,Punct",I am learning.
MATCH,"Chʼí,násh,dááh,.","Verb,Verb,VStem,Punct",I go out.
MATCH,"ʼáá,dóó, ,Dee,sháál,.","Conj,Conj,Space,Verb,VStem,Punct",And then I will come.
MATCH,"Désh,nish,.","Verb,VStem,Punct",I started working.
MATCH,"ʼáá,dóó, ,dí,néesh,daał,.","Conj,Conj,Space,Verb,Verb,VStem,Punct",And then I will sit.
MATCH,"Dóó, ,ha,taał,.","Conj,Space,Verb,VStem,Punct",And he/she sings.
MATCH,"Ii,deesh,hosh,.","Verb,Verb,VStem,Punct",I will go to sleep.
MATCH,"Iił,haazh,.","Verb,VStem,Punct",I went to sleep.
MATCH,"Na,ʼiis,niiʼ,.","Verb,Verb,VStem,Punct",He/She shopped.
MATCH,"Naa,shné,.","Verb,VStem,Punct",I am playing.
MATCH,"Naaʼ,naʼ,. ","Verb,VStem,Punct",It crawls about.
MATCH,"Ná,bish,.","Verb,VStem,Punct",It continues boiling.
MATCH,"E,ʼe,ʼaah, ,łi,tso,.","NSub,NSub,NSub,Space,Verb,VStem,Punct",The west is yellow.
MATCH,"E,ʼe,ʼaah, ,łi,tsxo,.","NSub,NSub,NSub,Space,Verb,VStem,Punct",The west is orange.
MATCH,"Ha,ʼa,ʼaah, ,łi,tso,.","NSub,NSub,NSub,Space,Verb,VStem,Punct",The east is yellow.
MATCH,"Ha,ʼa,ʼaah, ,łi,tsxo,.","NSub,NSub,NSub,Space,Verb,VStem,Punct",The east is orange.
MATCH,"Eesh,zhiizh,.","Verb,VStem,Punct",I danced.
MATCH,"bée,so, ,bis,gąʼ","NSub,NSub,Space,NSub,NSub",credit card
MATCH,"Me,sáí,yah","NSub,NSub,NSub",Messiah
MATCH,"God, ,yíí,sí,níł,tsʼą́ą́ʼ,.","NObj,Space,Verb,Verb,Verb,VStem,Punct",Listen to God.
MATCH,"doo, ,yá,ʼá,tʼééh, ,da","AdvNeg,Space,Verb,Verb,VStem,Space,AdvNeg",bad
MATCH,"Doo, ,bi,kʼiʼ,diish,tįįh, ,da,.","AdvNeg,Space,Verb,Verb,Verb,VStem,Space,AdvNeg,Punct",I don't understand.
MATCH,"Doo, ,á,kó,t’ée, ,da,.","AdvNeg,Space,Verb,Verb,VStem,Space,AdvNeg,Punct",It is wrong.
MATCH,"doo, ,háá,jí, ,da","AdvNeg,Space,Verb,VStem,Space,AdvNeg",nowhere
MATCH,"doo, ,na,gháí, ,da","AdvNeg,Space,Verb,VStem,Space,AdvNeg",nobody
MATCH,"bi,chʼi,yąʼ","NPos,NSub,NSub",his/her/its food
MATCH,"mó,sí, ,bi,chʼi,yąʼ","NSub,NSub,Space,NPos,NSub,NSub",cat food
MATCH,"John, ,bi,chʼi,yąʼ, ,éí, ,na,ha,cha,gii,tsoh, ,dóó, ,tsé,sʼná, ,bi,tłʼizh,.","NSub,Space,NPos,NSub,NSub,Space,Pro,Space,NObj,NObj,NObj,NObj,NObj,Space,Conj,Space,NObj,NObj,Space,Verb,VStem,Punct",John's food consisted of locusts and wild honey.

0

Answer 4

Frameworks Engineer OP

Apple

Jan ’25

The CSV file is correct, but I'm having a hard time understanding how the tokens are encoded. The word tagger needs an array of strings. So you need to go from that representation to an array of strings.

For example this line "?,.,!,"","","",:,;", after CSV processing becomes ?,.,!,",",",:,;. I assume it represents ["?", ".", "!", <comma>, <double quote>, ":", ";"], but you'll need to write custom code to make that interpretation. My suggestion is to use a different escape character to make that translation easier, for instance "?,.,!,\,,\"",:,;" and then interpret \ as an escape.

The other option is to use JSON encoding: "[""?"", ""."", ""!"", "","", ""\"""", "":"", "";""]" and then use this to decode:

 var dataFrame = try DataFrame(
    contentsOfCSVFile: url,
    types: ["TOKENS": .data]
)
try dataFrame.decode([String].self, inColumn: "TOKENS", using: JSONDecoder())

0

Answer 5

HullBreach OP

Jan ’25

Here is a more recent case to show what I'm trying to do, as that example with the punctuation was a proof-of-concept for testing. This includes a few commas within the text to be trained. Other examples include quotation marks.

Hodeeyáádą́ą́ʼ,Diyin,God,yótʼááh,hiníláii,índa,nahasdzáán,áyiilaa,.,Nahasdzáán,tʼáadoo,ánoolniní,da,",",índa,tʼáadoo,bikááʼ,siláhí,da,;,bikáaʼgi,tʼáá,átʼéé,nítʼééʼ,chahałheełgo,Diyin,God,biNíłchʼi,Diyinii,tó,yikááʼgóó,nahazleʼ,.,Áádóó,Diyin,God,ádííniid,",",Adinídíin,leʼ,.,Tʼáá,áko,adinídíín,hazlį́į́ʼ,.,Áko,Diyin,God,éí,adinídínígíí,yinééłʼį́įʼgo,bił,yáʼíítʼééh,",",áádóó,adinídínígíí,chahałheeł,yił,ałtsʼáyíínil,.

Adv,Adj,NSub,NObjPos,VPerf,Conj,NObj,VPerf,Punct,NSub,AdvNeg,VProg,PartNeg,Punct,Conj,AdvNeg,Adp,VImpf,PartNeg,Punct,Adp,Adv,VImpf,Adv,Adv,Adj,NSub,NSubPos,NSubPos,NAdp,AdpPos,VImpf,Punct,Conj,Adj,NSub,VPerf,Punct,NObj,VImp,Punct,Adv,Adv,NSub,VPerf,Punct,Adv,Adj,NSub,Pro,NObj,VPerfAdv,ProAdp,VPerf,Punct,Conj,NSub,NAdp,Adp,VPerf,Punct

The core problem is that Create ML does not seem to support several CSV escaping formats that various spreadsheet tools do (including Apple's own Numbers). Additionally, it does not support other file formats directly exported from Numbers that I could find. That makes commas and quotation marks difficult to include in any training data.

I've been able to get around this by writing my own tool that imports TSV files from Numbers and converts them to JSON files that Create ML accepts, adding 2 more steps to the training process each time. However, this post was originally about how to get Create ML to directly accept a Numbers CSV file without added steps every time. If it is not a bug, and Create ML just lacks the functionality, I will continue as I have with my custom work-around, and we can consider this issue resolved.

0

Answer 6

HullBreach OP

Jan ’25

Can I have a verification on whether or not Create ML supports escape characters in CSVs that allow for commas and quotation marks in training data? If it does not, I’ll continue building out my own tool to convert from data exported from Numbers so that I’m not held up in this area.

0

	var dataFrame = try DataFrame(
	contentsOfCSVFile: url,
	types: ["TOKENS": .data]
	)
	try dataFrame.decode([String].self, inColumn: "TOKENS", using: JSONDecoder())