The labeled document should look like this when complete.Highlight all instances of the following text values and assign the appropriate labels.Īdams, Chase and Gilbert Inc 972 Gonzalez Dam South Katherine NC 95869-5178.Complete for the other instance of CONTROL_NUMBER It should look like this once labeled.You can use the text filter to search for label names. Click on the "Bounding Box" Tool, then highlight the text "1173038" and assign the label CONTROL_NUMBER.Double-click on the document we imported earlier to enter the labeling console.We will need to label each entity every time it appears in the document. This is why we made each label have the Occurrence "Required multiple". NOTE: For this specific document structure, each entity appears twice on the same page. These labels will be used to train our model to parse this specific document structure and identify the correct types. Next, we will identify text elements and labels for the entities we would like to extract. Notice that the labels we created show up in the lower-left corner. Click on the Back arrow to return to the Training page.The Console should look like this when complete.Create the following labels using the Create Label button.You should now be in the Schema Management console.Click on Edit Schema in the bottom-left corner.Since we are creating a new processor type, will need to create custom labels to tell Document AI which fields we want to extract. When the import completes, you should see the Document in the Training page.Click Import.Ĭloud-samples-data/documentai/codelabs/custom/extractor/pdfs Leave the "Data split" as "Unassigned" for now. ![]() Copy and paste the following link into the Source Path box.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |