0% found this document useful (0 votes)

44 views13 pages

THISNet Tooth Instance Segmentation On 3D Dental Models Via Highlighting Tooth Regions

Uploaded by

2021uee0155

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views13 pages

THISNet Tooth Instance Segmentation On 3D Dental Models Via Highlighting Tooth Regions

Uploaded by

2021uee0155

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 34, NO.

7, JULY 2024 5229

THISNet: Tooth Instance Segmentation on 3D

Dental Models via Highlighting Tooth Regions
Pengcheng Li , Chenqiang Gao , Fangcen Liu , Deyu Meng , Senior Member, IEEE,
and Yan Yan, Senior Member, IEEE

Abstract— Automatic tooth instance segmentation on 3D dental applicable in medical image analysis [4], [5]. In clinical treat-
models is crucial for digitizing dental treatments and enabling ment, dentists and oral healthcare professionals routinely use
computer-assisted treatment planning. However, It is challeng- various imaging modalities include panoramic X-ray radiogra-
ing since the tight arrangement of dental structures and the
consequential impact of dental ailments on their morphological phy, cone-beam computed tomography and 3D dental model.
characteristics. To address these challenges, we propose a novel Automatic tooth instance segmentation on 3D dental mod-
method called THISNet. Unlike existing methods, THISNet els is essential for digitizing dental treatments and enabling
focuses on highlighting tooth regions rather than relying on computer-assisted treatment planning. Accurately identifying
bounding box detection, leading to improved accuracy in tooth and segmenting individual teeth allow for precise manipulation
segmentation and labeling. By incorporating the highlighted tooth
regions with a tooth object affinity module, our method effec- of teeth in the dental model, resulting in effective treat-
tively integrates global contextual information, considering the ment strategies. In orthodontic treatments, such as braces and
relationships between neighboring teeth and their surrounding invisible aligners, require detailed tooth information in dental
structures. THISNet adopts an end-to-end learning approach, models to develop treatment plans and monitor progress, and
reducing complexity and enhancing segmentation efficiency com- in dental restoration work, such as crowns, bridges and dental
pared to multi-stage training methods. Experimental results
demonstrate the superiority of THISNet over existing approaches, implants requires accurate tooth segmentation results. How-
highlighting its potential in various dental clinical applications. ever, compared with general 3D instance segmentation [6],
[7], [8], tooth instance segmentation encounters kinds of
Index Terms— Tooth segmentation, highlighting, object affinity,
3D dental models. challenges. Specifically, the teeth are tightly arranged, making
it difficult to determine the boundaries of each tooth. Addi-
I. I NTRODUCTION tionally, tooth wear, dental caries, and other oral diseases can
significantly alter the shape of teeth, further increasing the
I NSTANCE segmentation [1], [2], [3] is one of the most
important tasks in computer vision. It can detect and
delineate instances appearing in an image, which is widely
complexity of classification and segmentation tasks. These
challenges emphasize the intricate nature of tooth instance
segmentation, thereby highlighting the necessity for the design
Manuscript received 19 July 2023; revised 17 October 2023; accepted of powerful algorithms.
4 December 2023. Date of publication 12 December 2023; date of current A variety of methods have been proposed for tooth instance
version 3 July 2024. This work was supported in part by the National Key
Research and Development Program of China under Grant 2022YFA1004100;
segmentation [9], [10], [11], [12], [13]. These methods can be
in part by the National Natural Science Foundation of China under Grant divided into two broad categories: grouping-based methods
62176035, Grant 62201111, Grant 61721002, and Grant 12226004; in part by and detection-based methods. Fig. 1 shows the differences
the Science and Technology Research Program of the Chongqing Municipal
Education Commission under Grant KJZD-K202100606; in part by the
between these two kinds of methods. Grouping-based meth-
Chongqing University of Posts and Telecommunications Ph.D. Innovative ods [12], [14] involve grouping points in 3D dental models
Talents Project under Grant BYJS202105; and in part by the Chongqing using semantic segmentation and clustering techniques. Each
Graduate Research Innovation Project under Grant CYB22249. This article
was recommended by Associate Editor J. Chen. (Corresponding author:
group is assigned a unique identifier to enable the segmenta-
Chenqiang Gao.) tion and labeling of different tooth instances. These methods
Pengcheng Li, Chenqiang Gao, and Fangcen Liu are with the School typically group points in 3D dental models based on their
of Communications and Information Engineering, Chongqing University of
Posts and Telecommunications, Chongqing 400065, China, and also with the
positions and semantic similarities. In the case of densely
Chongqing Key Laboratory of Signal and Information Processing, Chongqing arranged or tightly contacted teeth, grouping-based methods
400065, China (e-mail: [email protected]; [email protected]; may struggle to accurately separate them due to the presence
[email protected]).
Deyu Meng is with the School of Mathematics and Statistics, Xi’an Jiaotong
of similar semantics, resulting in inaccurate segmentation
University, Xi’an, Shaanxi 710049, China, and also with the Henan Engineer- outcomes.
ing Research Center for Artificial Intelligence Theory and Algorithms, School In contrast, detection-based methods [9], [10], [11] offer
of Mathematics and Statistics, Henan University, Kaifeng, Henan 475004,
China (e-mail: [email protected]).
greater intuitiveness. They first utilize an object detection
Yan Yan is with the Department of Computer Science, Illinois Institute of network to identify distinct tooth proposals in the 3D dental
Technology, Chicago, IL 60616 USA (e-mail: [email protected]). model. Subsequently, the points within the detected bounding
Color versions of one or more figures in this article are available at
https://doi.org/10.1109/TCSVT.2023.3341805.
boxes are directly segmented to achieve instance-level seg-
Digital Object Identifier 10.1109/TCSVT.2023.3341805 mentation and labeling for each tooth instance. Nevertheless,
1051-8215 © 2023 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.

Authorized licensed use limited to: Indian Institute Of Technology Jammu. Downloaded on October 22,2024 at 12:34:01 UTC from IEEE Xplore. Restrictions apply.
5230 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 34, NO. 7, JULY 2024

effectively integrates global contextual information. This

enables the model to consider the relationships and inter-
actions between neighboring teeth and their surrounding
structures.
• Experimental results demonstrate the effectiveness and
superiority of the method over existing approaches,
which highlights its potential in various dental clinical
applications.
The rest of the paper is organized as follows. In Section II,
we present related works on tooth segmentation for 3D
dental mesh models, as well as the applications of bipartite
matching and 3D instance segmentation on point cloud data.
A detailed description of the proposed method is presented in
Section III. Section IV presents the experimental benchmark
dataset, implementation details, metrics, results, and discus-
Fig. 1. Illustration of different types of 3D tooth instance segmentation sions. Finally, Section V draws the conclusion.
methods. (a) The grouping-based method merges points by position and simi-
lar semantics into different groups, assigning each group a unique label. This
approach is sensitive when teeth are densely arranged; (b) The detection-based II. R ELATED W ORKS
method which relies on bounding box detection, may result in inadequate
segmentation results since the bounding box could cover parts of neighbor A. Dental Model Segmentation
teeth; (c) The proposed highlight-based method offers greater flexibility
and adaptability compared to bounding boxes. It enables the description of 1) Knowledge-Based Methods: The traditional methods
irregular regions, which closely corresponds to the boundaries of teeth. It also of dental model segmentation based on handcrafted priors
leverages global contextual information to take into account the relationships
between neighboring teeth to generate accurate tooth instances. (e.g., morphological skeleton [15], harmonic field [16]), were
designed to guide the segmentation process, which could be
broadly categorized as curvature field-based methods, surface
as teeth are closely contacted, a bounding box usually covers contour line methods, and 2D-based methods. Minimum prin-
parts of neighbor teeth, which could result in segmenting cipal and mean curvature were utilized to identify the potential
multiple teeth as a single entity. Additionally, these methods tooth boundaries [17], [18], [19]. However, these methods
primarily concentrate on local bounding boxes, which leads to lacked robustness since the curvature field could be unreliable.
the neglect of crucial global contextual information. Specifi- Surface contour line methods [20], [21], [22] manually labeled
cally, they fail to consider the spatial relationships between several tooth boundary landmarks first, and the harmonic field
teeth and the overall structure of the dental arch. Actually, was then employed to segment the individual tooth. However,
global context incorporation is essential for achieving accurate such methods were tedious and time-consuming. To simplify
tooth boundary segmentation and capturing finer details. the processing of 3D mesh data, the interstices between teeth
Different from the above two kinds of methods, we propose were separately detected using two range images computed
a novel method called THISNet in this paper. The key idea is from the raw dental model [23]. The 2D projection results
to locate each tooth by highlighting tooth regions using a tooth were then converted back into the 3D space for each individual
affinity module instead of relying on bounding box detection. tooth segmentation. Nevertheless, these methods were difficult
Compared to locating tooth regions based on regular-shaped to handle the crowded malocclusion cases. Overall, these
bounding boxes, our proposed tooth affinity module assists handcrafted methods are typically designed based on tooth
the network to capture discriminative regions for teeth with shape and position priors, which limits their applicability and
irregular shapes, which intrinsically more matches the real robustness for handling challenging 3D dental models.
nature of teeth. We also leverage global contextual information 2) Semantic-Based Methods: Convolutional neural network
in the tooth affinity module to model the relationships and (CNN) based semantic segmentation methods [24], [25] have
interactions between neighboring teeth and their surrounding been investigated for 3D dental model segmentation by vox-
structures. Consequently, it facilitates to capture the shape and elizing the 3D raw dental models. To tackle the challenge of
structural information, which enhances the overall understand- individual tooth labeling with complex appearances, a two-
ing of the dental context and leads to improved accuracy in level hierarchical CNN structure was proposed for tooth
segmentation and labeling. segmentation [24]. The first level labeled teeth and gingiva,
The main contributions are summarized as follows: while the second level labeled inter-teeth regions. To refine the
• A novel method called THISNet is proposed for accurate boundary with a fuzzy clustering strategy, a graph-based label
tooth instance segmentation and labeling in 3D dental optimization method was employed. Then, the non-uniformly
models in an end-to-end manner. The method locates Monte Carlo sampling method [26] was proposed to address
each tooth by highlighting regions instead of relying the significant loss of tooth boundary details. Graph neural
on bounding box detection, which leads to improved network (GCN) based methods [14], [22], [27], [28] were then
accuracy in tooth segmentation and labeling. proposed to be directly applied to the 3D dental models for
• By incorporating the highlighted tooth regions with tooth segmentation. Geodesic maps and category-based spatial
the tooth object affinity module, the proposed method relationships were explored to facilitate tooth segmentation

Authorized licensed use limited to: Indian Institute Of Technology Jammu. Downloaded on October 22,2024 at 12:34:01 UTC from IEEE Xplore. Restrictions apply.
LI et al.: THISNet: TOOTH INSTANCE SEGMENTATION ON 3D DENTAL MODELS 5231

with abnormal tooth arrangements. HDWE [29] developed C. Instance Segmentation on Point Cloud Data
a novel hierarchical deep word embedding for fine-grained 3D instance segmentation methods on point cloud data
recognition. MeshSegNet [27] presented a tooth segmentation can be roughly categorized into top-down and bottom-up
method that leveraged both point and triangle mesh cells methods. Top-down methods first identified potential object
to learn multi-scale contextual and global features from raw candidates using object detection and then performed instance
dental models. TSGCNet [28] introduced a two-steam graph segmentation on local regions of the 3D point cloud data.
convolutional network that utilized coordinate and normal GSPN [37] proposed to employ the analysis-by-synthesis
vector properties to learn multi-view geometric information method to generate object proposals from noisy scenes. 3D-
and extract more effective representations for tooth segmenta- BoNet [38] used a top-down strategy to predict a fixed set of
tion. STSNet [30] presented the first attempt to unsupervised 3D proposals based on bounding box detection. 3D-SIS [39]
pre-training for 3D tooth segmentation. However, semantic- generated boxes and segmentation masks by employing both
based methods struggle to accurately delineate individual tooth geometric and color features. Bottom-up methods employed a
boundaries when teeth are in close proximity, which leads grouping strategy after a discriminative feature embedding to
to neighboring teeth being incorrectly classified. In contrast, perform segmentation. Point-to-surface representation [40] is
the proposed tooth object affinity-based method effectively employed to assemble local and global geometric information
highlights areas corresponding to individual teeth, enabling for 3D point cloud data. SGPN [41] employed a similarity
accurate tooth segmentation and identification. matrix to cluster similar points into instances. VLAAD [42]
3) Instance-Based Methods: Mask-MCNet [9] achieved enhanced the discriminative power of feature representations
localization and segmentation of each tooth instance by first by adaptively assigning weights to each residue vector, which
predicting 3D bounding boxes and subsequently segmenting lay between the descriptors and the centroid of cluster to which
the points belonging to each tooth instance. A method based on they belong. PointGroup [43] predicted the offset of each point
proposal generation and cluster grouping is proposed for point relative to the center point of the instance, and fused the
cloud-based tooth instance segmentation [12], which employs semantic segmentation results and offsets to generate cluster
an attention mechanism that combines objectness and point- proposals. SoftGroup [44] allows each point to be associated
wise knowledge. It is effective in scenarios where teeth are with multiple classes to mitigate the problems stemming
undamaged or minimally worn, and where tooth boundaries from semantic prediction errors. Totally, those segmentation
are well-defined and without blurring. TSegNet [10] proposed methods designed for the large-scale scene may be difficult to
a robust tooth centroid prediction and accurate single tooth adapt to the particularity of the tight arrangement of teeth.
segmentation on point cloud data. MLMSM [31] proposed a
novel semi-supervised framework to train two self-supervised
models and then employed a model ensemble approach to III. M ETHOD
address the problem of limited data learning in downstream A. Overview
tasks. DArch [11] then introduced a semi-supervised frame- Fig. 2 shows the proposed network. Specifically, we first
work that included a two-stage process for tooth centroid utilize a two-stream multi-scale feature encoder to extract
detection and instance segmentation. In this paper, we utilize discriminative coordinates and normal features from the
tooth object affinity maps to directly highlight the regions 3D dental models. Then, a tooth object affinity module is
corresponding to each tooth and segment them in an end- employed to highlight the tooth regions of interest. Finally,
to-end fashion. By consolidating contextual information from in the identification head, a segmentation module leverages
these highlighted regions corresponding to each tooth, we can the highlighted regions to perform accurate tooth mask seg-
obtain accurate tooth instance segmentation results. mentation, and a labeling module performs classification and
objectness identification to assign labels to each segmented
B. Correspondence Matching tooth mask. By combining these components, our method
achieves precise tooth instance segmentation and labeling in
Click prediction [32] was proposed to address the 3D dental models.
text-based image search problem through multimodal hyper-
graph learning-based sparse coding. Bipartite matching [33]
was employed to find correspondences between objects in an B. Two-Stream Multi-Scale Feature Embedding
image and assign them to the same class or category. It is Since tooth objects are tightly arranged and vary in scale
widely used for object detection and segmentation to detect and orientation, inspired by TSGCNet [28], we employ a two-
and segment objects of interest in an image or a video [34], stream multi-scale feature encoder to extract tooth features
[35]. SparseInst [36] achieved real-time image segmentation from a given 3D dental model. It is noted that the feature
by using sparse representations to highlight important regions encoder is flexible and can also be replaced by existing point
in 2D images, and by utilizing bipartite matching to assign the cloud feature encoders (e.g., PointNet [45], PointNet++ [46]
correct class labels to target objects. In this paper, we explore and DGCNN [47]).
an object-aware optimal transport assignment (OOTA) strategy The input of the proposed network is a matrix of size
using bipartite matching [33], [36] in three-dimensional (i.e., M ×24, where M indicates the number of mesh cells, and each
3D mesh) space to pair the predicted tooth regions and the mesh cell contains four 3D coordinates (i.e., three vertices and
ground truth masks in the training stage. the centroid point) and four corresponding normal vectors in

Authorized licensed use limited to: Indian Institute Of Technology Jammu. Downloaded on October 22,2024 at 12:34:01 UTC from IEEE Xplore. Restrictions apply.
5232 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 34, NO. 7, JULY 2024

Fig. 2. The proposed network architecture contains three key parts: (1) a two-stream multi-scale feature encoder that generates basic tooth feature
representations to handle the tightly arranged and vary shape teeth, (2) an object affinity-based decoder that contains a mask branch to generate tooth
mask features, and an instance branch using tooth object affinity module to highlight potential tooth regions, respectively, (3) an identification head that is
designed to segment and classify each tooth instance using the object-aware optimal transport assignment (OOTA) strategy.

3D dental models. To handle variations in orientation across where ci and n i indicate the coordinate and normal vectors
different 3D dental models, we use a feature transform module of i-th node, ci j and n i j represent one of K node coordinates
(FTM) to obtain a transformation matrix that normalizes the and normal vectors adjacent to the i-th node, and ⊕ denotes
input dental model in both the coordinate and normal vector by concatenation.
streams. FTM comprises three consecutive 1D convolutional To capture the basic topology structure of the tooth, we uti-
layers, with each layer followed by a batch normalization and lize a graph attention mechanism and max-pooling to focus
a ReLU activation. The dimension of these layers are 64, on the coordinate and normal vector attributes based on the
128, and 1024, respectively. Then, max pooling is applied updated cell node features. Specifically, in the coordinate
for feature aggregation, and three linear layers are used for stream, the learnable connection attention weight ail j for each
dimensional reduction. The output dimensions of the first and coordinate can be calculated using the neighbor features f ci j
second linear layers are 512 and 256, respectively. The output by:
of the last linear layer takes the form of a transformation
matrix with a size of C × C, where C matches the dimension ail j = Conv(△ f cli j ⊕ f cli j ), ∀ci j ∈ K, (2)
of the input coordinates and normal vectors, set at 12. This
learned transformation matrix is applied to the input coor- where △ f cli j = f cli − f cli j measures the difference between the
dinates and normal vectors for normalization through inner coordinate i and its one of K nearest neighbors j, then the
product operations. final output coordinate node features can be calculated by:
The multi-scale feature embedding module is designed X
to obtain different scales of tooth feature representations. Fcl+1 = ail j ⊙ fˆcli , (3)
To learn local feature representations, we build K-nearest n i j ∈Ni
neighbor (KNN) graphs at multiple scales for each mesh
cell. Since nodes on the edges of the mesh can be biased where ⊙ denotes element-wise production.
towards neighboring meshes, which results in discontinuities In the normal vector stream, the normal node features are
and inhomogeneities, we take the mesh cell centroids as the aggregated using max-pooling:
KNN nodes to keep the evenly distribution. The KNN graphs
identify the neighboring nodes that are closest to each cell, Fnl+1 = maxpooling( f nl i j ), ∀n i j ∈ K. (4)
which is essential for capturing local structure information.
To construct the KNN graph for each mesh cell (i.e., vertex and The coordinate node features and normal node features of
centroid) in the 3D dental models, we compute the Euclidean different scales are fused by an MLP layer:
distance between each cell node and its K nearest neighbors,
resulting in a graph G(V, E) where V represents the cell node Fc = MLP(Fc1 ⊕ Fc2 ⊕ Fc3 ),
attribute sets (i.e., coordinates or normal vectors) denoted as Fn = MLP(Fn1 ⊕ Fn2 ⊕ Fn3 ). (5)
V = {n 1 , n 2 , . . . , n M }, and E ⊆ {(x, y) | (x, y) ∈ V 2 ∧x ̸ = y}
represents the edges of the graph. For each node n i ∈ V , the The final tooth feature embedding F in both coordinate and
KNN set is denoted as K. The updated coordinate and normal normal streams are concatenated, followed by another MLP
features at l-th scale fˆcl and fˆnl can be respectively computed layer, and passed through a two-branch decoder based on tooth
by combining the input feature vectors with the contextual object affinity:
neighbor features:
fˆcli = f cli ⊕ f cli j , ∀ci j ∈ K, F = MLP(Fc ⊕ Fn ), F ∈ R D×M , (6)
fˆnl i = f nl i ⊕ f nl i j , ∀n i j ∈ K, (1)
where D is the dimension of the tooth feature embedding.

C. Tooth Object Affinity Module The mask features Fm are obtained by applying a 1×1 projec-
To help capture better tooth boundaries and contours, tion convolutional layer with D channels on the tooth feature
as well as leverage global contextual information, we design to embedding F in the mask branch of the decoder:
directly highlight potential tooth regions in the object affinity- Fm = Conv(F), Fm ∈ R D×M . (11)
based decoder, as shown in Fig. 2. It consists of two branches:
a mask branch and an instance branch. The mask branch is Finally, two additional 1×1 convolutional layers are employed
responsible for generating tooth instance mask features, which on the grouped tooth instance features Ẑ for tooth classifica-
distinguish teeth from the surrounding structures. To collect tion and objectness confidence prediction, respectively.
tooth mask features, we utilize a vanilla 1 × 1 convolutional By multiplying the updated tooth instance features with
layer in the mask branch. the mask features, our identification head generates the final
In the instance branch, the tooth object affinity module is tooth instance masks. The mask features obtained through
designed to generate N tooth instance activation maps for the projection convolutional layer can refine and align the
highlighting individual tooth areas. Thees tooth object affinity tooth instance masks with the underlying tooth structures.
maps are instance-level weighted maps that highlight each Furthermore, the tooth instance features are further processed
tooth region, automatically capturing the semantic and instance for tooth classification and objectness confidence prediction,
information from the final tooth feature embedding, thereby enabling comprehensive tooth labeling and enhancing the
facilitating the identification and segmentation of each tooth. overall segmentation performance.
Specifically, the tooth object affinity module consists of
a vanilla 1 × 1 convolutional layer Conv(·), a non-linear E. Objectness-Based Loss Functions
Sigmoid activation function σ (·), and a layer normalization
As the network generates a fixed number of N tooth
L N [·]. The tooth object affinity maps Foa are generated using
instances, we utilize an end-to-end training approach with
the tooth feature embedding F ∈ R D×M obtained from the
bipartite matching to establish a one-to-one correspondence
two-stream multi-scale feature encoder:
between the predicted tooth objects and the ground truth
Foa = L N [σ (Conv(F))], Foa ∈ R N ×M , (7) teeth [35], [48]. Given the imbalance between teeth and the
background in the 3D dental model, it is prone to predict the
where N represents the number of the tooth object affinity
tooth as the background. To mitigate this issue, we propose
maps, and M denotes the input number of mesh cells. Foa
an object-aware optimal transport assignment (OOTA) strategy
represents a sparse set of N tooth instance activation maps,
based on the dice similarity coefficient (DSC) and tooth clas-
which are instance-aware weighted maps designed to empha-
sification confidence. Building on SparseInst [36], our OOTA
size areas of information specific to each tooth object. The
strategy aims to achieve precise one-to-one matching between
tooth instance features Z are obtained by aggregating rich
the predicted tooth instances and the corresponding ground
contextual information using the tooth object affinity maps
truth teeth. To achieve this matching, we introduce a pairwise
Foa and the tooth feature embedding F:
matching score S(i, j) to assess the similarity between the i-th
Z = Foa · F T , Z ∈ R N ×D , (8) tooth instance prediction and the j-th ground truth tooth. The
where z = (z i )N
is the instance feature representation of N matching score is calculated as:
potential tooth objects in the dental model. This aggregation S(i, j) = DSC(m i , g j )α · pi,c
1−α
j
, (12)
process combines the local tooth features with the highlighted
regions to capture both local and global information. where m i and g j represent the masks of the i-th predicted
To further improve generalization capability and mitigate tooth instance and the j-th ground truth tooth, respectively.
the impact of redundant information, similar to the method The probability pi,c j denotes the likelihood of the i-th predic-
in [36], we reshape the tooth instance features Z into multiple tion belonging to category c j . The hyper-parameter α is used
groups and introduce a group convolution operation: to control the balance between the impact of segmentation and
classification and is empirically set to 0.8 [36]. In addition, the
Ẑ = G({Z 1 , Z 2 , . . . , Z G }; 2), (9) Dice similarity coefficient score is calculated as:
where G(·) represents the group convolution operation with P
2 x,y,z m x yz · gx yz
shared parameters 2, 1 to G represent the indexes of DSC(m, g) = P 2
P 2
, (13)
subsets of channels divided among G groups. By reshaping x,y,z m x yz + x,y,z gx yz
the tooth instance features into groups and applying group where the mesh cells at position (x, y, z) in the prediction
convolution, we can effectively extract and preserve key fea- mask m and ground truth mask g are represented as m x yz and
tures related to tooth instances. gx yz , respectively.
We employ the object-aware optimal transport assign-
D. Tooth Identification Head ment (OOTA) strategy, which leverages the Hungarian
In the identification head, the final N tooth instance masks algorithm [33] to achieve optimal matching between the
M can be obtained by element-wise multiplication of the ground truth objects and N predictions. The complete OOTA
updated tooth instance features Ẑ with the mask features Fm : strategy is described in Algorithm 1.
Finally, the proposed training loss function is computed
M = Ẑ · Fm , M ∈ R N ×M . (10) between the tooth instance predictions that have been matched

Authorized licensed use limited to: Indian Institute Of Technology Jammu. Downloaded on October 22,2024 at 12:34:01 UTC from IEEE Xplore. Restrictions apply.
5234 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 34, NO. 7, JULY 2024

Algorithm 1 Object-Aware Optimal Transport Assignment IV. E XPERIMENTS

Require: We conduct all our experiments on Teeth3DS1 [51], a pub-
The set of predicted masks for current batch Mi ; licly available large-scale dataset that provides a diverse range
The set of predicted tooth classification scores for current of dental scans and is considered a reliable benchmark for
batch pi,c j ; evaluating the performance of dental segmentation models.
The corresponding ground truth mask gi ; The Teeth3DS dataset comprises 1,200 optical scans collected
Ensure: from 600 patients in a real-world clinical environment. The
The index of the instance I that matches the label among scans include both upper and lower jaws and cover patients
the predicted N instances; of different ages requiring different degrees of orthodon-
1: Initialization; tic treatment and prosthetic implantation. The optical scans
2: Extracting the batch size B and the number of input dentel have a varying number of points ranging from 13,000 to
mesh cells M; 260,000, and each original optical scan contains more than
3: Calculating DSC between the N predicted masks M and 100,000 mesh triangle cells or faces. For our experiments,
the ground truth masks by Eq. (13); we downsampled the original scans to 16,000 cells using
4: Selecting the predicted tooth classification scores corre- MeshLab [52], following the methods adopted in previous
sponding to the label: pi,c j ; works [25], [27]. In our experiments, we formulate the task
5: Calculating the matching score by Eq. (12), S(i, j); of tooth segmentation as a 15-class segmentation problem.
6: One-to-one matching of the cost matrix using the Hun- Each dental mesh model is divided into 14 tooth classes,
garian algorithm, calculating the index I that make the including the Central incisor, Lateral incisor, Canine, 1st and
highest score between the predicted result and ground 2nd Premolar, and 1st and 2nd Molar on both left and right
truth. sides, as well as the gingiva. To evaluate the performance of
7: return I our method, we perform ablation studies using an 80%-10%-
10% training-validation-testing split. Additionally, we report
the segmentation results using a three-fold cross-validation
with the ground truth teeth using the OOTA strategy. The setup, comparing our method with state-of-the-art methods.
network is trained using a combination of three loss functions:
a weighted focal loss [49] for tooth classification (Lcls ), an L2 A. Experimental Settings
loss is employed to select high-quality tooth regions in the 1) Implementations: The training and testing phases are
objectness prediction branch of the identification head(Lobj ): implemented end-to-end using PyTorch on 4 NVIDIA TESLA
Mgt V100 GPUs. We set the batch size to 8 and trained the
1 X
Lobj = P M 1iou0.5 ∥sm − ioum ∥2 , (14) model for 400 epochs to update the gradients. The initial
gt
m=1 1iou0.5 m=1 learning rate is set to 1e−3 , and it is decayed by a factor of
where Mgt represents the tooth number of ground truth, sm 0.5 every 50 epochs. To prevent over-fitting during training,
denotes the predicted tooth objectness score, ioum is the IOU we employ an early stopping strategy where the training
between the paired mask prediction and ground truth by the would stop if the validation set IOU does not increase within
OOTA strategy, 1iou0.5 indicates whether the IOU is larger than 25 epochs. The total loss function is minimized using the
50%. Adam optimizer. To aggregate adjacent node information in
A combined mask loss (Lmask ) includes a dice loss and the encoder, we construct the KNN graph with a value of
binary cross-entropy loss [50] for mask segmentation: 32 for K . The generated number of tooth instances N for
each dental model is set to 30, and the objectness confidence
Lmask = λdice · Ldice + λce · Lce , (15) score is set to 0.5 for obtaining the final tooth classification
λdice and λce are hyper-parameters and set to 1.0 and 2.0, labels. We empirically set λcls , λmask , and λobj to 2.0, 1.0,
respectively. 1.0, respectively, as same as [36].
Finally, the overall loss function can be calculated by: 2) Evaluation Metrics: We evaluate the tooth segmentation
performance of the proposed method using overall accuracy
L = λcls · Lcls + λmask · Lmask + λobj · Lobj , (16)
(OA) and mean intersection over union (mIOU). The evalua-
where the hyper-parameters λcls , λmask , and λobj are used to tion process follows the same approach as in previous works
balance the impact of each loss term. such as [28] and [53]. OA is calculated by:
Nc
F. Inference OA = , (17)
Nall
The object-aware optimal transport assignment strategy is where Nc indicates the number of correctly segmented mesh
only employed during the training stage. During the inference cells, and Nall represents the number of all mesh cells.
stage, the proposed network can directly output N instances of The mIOU can be calculated by:
teeth along with their corresponding segmentation masks M, k
classification scores cls, and objectness score obj. The final 1 X pii
m I OU = Pk Pk , (18)
tooth categories are determined based on the point-wise mul- k+1 j=0 pi j + j=0 p ji − pii
i=0
tiplication between classification score and high-confidence
objectness scores. 1 https://3dteethseg.grand-challenge.org/3DTeethSeg/

TABLE I
C OMPARISON ON T HREE -F OLD C ROSS -VALIDATION W ITH S TATE - OF - THE -A RT M ETHODS

where k is the number of tooth object categories, and pi j its effectiveness in 3D tooth model segmentation tasks. This
denotes predicting class i as class j, and p ji represents the improvement can be attributed to the two-stream multi-scale
false positive cells that are predicted as class j while the true feature encoder, which captures normal vectors and contextual
category is i. information and effectively fuses multi-scale features. This
mitigates the semantic confusion caused by the similarity of
tooth location and morphology. Compared to DGCNN [47],
B. Comparison With State-of-the-Art Methods the proposed method achieves an improvement of 1.08% and
1) Competing Methods: The performance of the proposed 4.37% in OA and mIOU. The separate learning of spatial
method is compared with eight state-of-the-art methods on the coordinates and normal vectors contributes to extracting more
benchmark 3D oral scan datasets Teeth3DS, and the OA and discriminative features and enhancing the accuracy of tooth
mIOU metrics are reported. The compared methods include: segmentation in dental models. In comparison with Mesh-
• PointNet [45]: A pioneering network for classification and SegNet [27], [54], the proposed method outperforms it by
segmentation that directly uses unordered 3D point cloud 0.61% in OA and 2.84% in mIOU. By constructing dynamic
data. K-nearest neighbor (KNN) graphs, we flexibly learns global
• PointNet++ [46]: This method extends PointNet by tooth features while preserving local information. Compared
employing set abstraction and grouping on PointNet to to TSGCNet [25], [53], the proposed method achieves an
learn local contextual information for 3D point cloud data improvement of 1.59% in mIOU. This highlights the effec-
analysis. tiveness of our tooth object affinity module and identification
• DGCNN [47]: This method learns semantic information head in segmenting each complete tooth instance region.
from point cloud data by constructing local adjacency In comparison to the grouping-based methods [41], [43], our
matrices and updating the graph structure dynamically method outperforms PointGroup [43] by 1.44% in OA and
across different layers. 4.86% in mIOU. We attribute to the grouping-based methods
• MeshSegNet [27], [54]: This is a deep neural network are difficult to obtain accurately semantics of adjacent teeth
that operates at multi-scales to learn high-level geometric with similar shapes, which leads to inaccurate grouping results.
features for end-to-end tooth segmentation on 3D dental On the contrary, our method models the interaction between
models. adjacent teeth by using global context information in the tooth
• TSGCNet [28], [53]: This method segments 3D dental object affinity module, which is conducive to capturing the
models by adopting two graph-learning streams to extract discriminative features of tooth shapes, thereby improving the
more discriminative geometric representations from coor- segmentation accuracy. Compared to detection-based method
dinates and normal vectors. 3D-BoNet [38], the proposed method achieves an improve-
• SGPN [41]: This is a pioneering work in 3D instance ment of 1.7% and 6.99% in OA and mIOU. This may impute to
segmentation from point clouds, using a single network that the detection-based method segment objects directly in the
to generate proposals and assigning a corresponding detected bounding boxes, and these bounding boxes may cover
semantic class for each object. a portion of the adjacent teeth since teeth are tightly arranged,
• PointGroup [43]: This is a centroid-based 3D instance which leads to the segmentation of multiple teeth into a single
segmentation method that aims to identify and label entity. Instead, we use the tooth object affinity module to
individual objects in 3D point cloud data. locate each tooth region, and the highlighted irregular areas
• 3D-BoNet [38]: This is a bounding box detection-based are more in line with the nature of the teeth, hence improve
3D instance segmentation method that is widely used the segmentation accuracy.
in tooth instance segmentation and 3D point cloud To assess the quality (i.e., completeness) of tooth instance
segmentation. segmentation results produced by different methods, we set
2) Quantitative Results: The quantitative comparison various predefined thresholds for IOU (Intersection over
results on three-fold cross-validation with state-of-the-art Union) between the predicted tooth instance masks and the
methods are listed in Table I. The evaluation metrics include ground truth. We consider tooth instance segmentation results
OA, mIOU, and IOU for each tooth category and background. to have high quality when the IOU exceeds the thresh-
Overall, the proposed method demonstrates superior perfor- olds. Fig. 3 presents the comparison of different methods
mance compared to existing methods based on semantic and in terms of the completeness of tooth segmentation results.
instance segmentation. It outperforms PointNet++ [46] in We change the IOU threshold from 80% to 90% with a
terms of OA by 4.49% and mIOU by 11.41%, indicating step size of 2% and can observe that the completeness

Authorized licensed use limited to: Indian Institute Of Technology Jammu. Downloaded on October 22,2024 at 12:34:01 UTC from IEEE Xplore. Restrictions apply.
5236 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 34, NO. 7, JULY 2024

TABLE II
C OMPUTATION OVERHEAD C OMPARISON W ITH
S TATE - OF - THE -A RT M ETHODS

Fig. 3. The segmentation quality comparison for tooth segmentation under

different IOU thresholds between the predicted tooth segmentation results and
the ground truth teeth. tooth categories and completes tooth morphology even in these
challenging samples. This improved performance is attributed
to the tooth object affinity module, which generates reliable
of the tooth segmentation decreased as the IOU threshold tooth instance activation maps to model the irregular tooth
increased. For example, PointNet++ [46] experiences a drop shape for subsequent segmentation.
of more than 60% in tooth segmentation completeness when Fig. 5 further illustrates the comparison between our method
the IOU threshold is changed from 80% to 90%. Simi- and detection-based method 3D-BoNet [38] in terms of tooth
larly, the instance-based segmentation methods SGPN [41], localization and segmentation results. It can be observed that
PointGroup [43] and 3D-BoNet [38] shows decay of 20%, the 3D-BoNet tends to identify adjacent teeth to the object
50% and 40%, respectively. This decay can be attributed as the detected tooth within the current bounding box, which
to the insufficient ability of grouping-based and detection- leads to poor accuracy in tooth boundary segmentation. On the
based methods to identify individual teeth, leading to a rapid contrary, our highlight-based method can effectively identify
deterioration of tooth segmentation completeness. In contrast, accurate tooth regions, resulting in better segmentation results
the proposed method maintains better tooth completeness and for the tooth boundaries.
robustness compared to state-of-the-art methods, highlighting 4) Inference Computation Overhead Comparison: We eval-
its superiority in achieving tooth segmentation. uate the computational overhead of the proposed method
for inference and compare it with the current state-of-the-
3) Qualitative Results: We present a comparison of the seg-
art methods. Table II presents the inference floating point
mentation results of state-of-the-art methods and the proposed
operations (FLOPs) and parameters by each method. The
method in Fig. 4. In the first row, we show a 3D tooth model
proposed method has comparable parameters to semantic
with a complete set of teeth. We can observe that all the
segmentation-based methods while outperforming them in seg-
compared methods exhibit good segmentation performance in
mentation performance. Moreover, the proposed method has
this case. However, the proposed method shows better visual-
lower FLOPs than the instance segmentation-based methods
ization results compared to PointNet [45] and DGCNN [47].
SGPN [41] and PointGroup [43], indicating that the proposed
This improvement can be attributed to the fact that PointNet
method has a faster inference speed for tooth instance seg-
and DGCNN misclassify two adjacent teeth categories and
mentation. This is because the tooth object affinity module
fail to accurately discern tooth boundaries. We attribute this
directly highlights the tooth instance regions without requiring
limitation to their limited ability to extract local area informa-
additional post-processing steps, such as sequencing or non-
tion. The second row presents the segmentation results when a
maximum suppression.
tooth is missing, specifically the second molar. PointNet [45],
PointNet++ [46], DGCNN [47], TSGCNet [25], [53] and
SGPN [41] tend to over-segment the gum as the missing C. Ablation Study
molar. While MeshSegNet [27], [54], PointGroup [43], and We conduct ablation studies on the proposed method with
the proposed method correctly identify the number of teeth, an 80%-10%-10% training-validation-testing split to evaluate
both the compared methods exhibit some conglutination in the effectiveness of each novel component.
tooth boundaries. In contrast, the proposed method effec- 1) Effect of the Number of Instances: To demonstrate
tively improves boundary segmentation performance through the network’s robustness to variations in the number of
multi-scale information extraction and highlights tooth areas teeth across different patients, we evaluate the segmentation
using the tooth object affinity module. The third and fourth performance using varying numbers of tooth instance activa-
rows demonstrate challenging samples with missing teeth and tion maps. The segmentation results obtained with different
abnormal shapes and sizes. SGPN [41] tends to assign the numbers of tooth instance activation maps are presented in
missing tooth area the corresponding tooth category, disregard- Table III. It can be observed that increasing the number of
ing the rationality of the real tooth arrangement. In contrast to activation maps from 14 to 30 results in an improvement of
existing methods, which tend to include the surrounding gum 0.29% and 0.09% in OA and mIOU, respectively. However,
area as part of the tooth region when abnormal shapes and when the number of activation maps is further increased
sizes are present, the proposed method accurately identifies from 30 to 50, OA shows a slight decrease while mIOU

Fig. 4. Qualitative results compared with state-of-the-art methods. The first row shows segmentation results on a complete number of teeth (excluding the
third molars due to the variability in tooth specificity between patients). The second row displays results on missing one tooth, while the third and fourth rows
show more challenging cases. Red dotted circles and arrows indicate segmentation distinctions of recent state-of-the-art methods, and the proposed method
maintains better segmentation performance and robustness.

TABLE IV
E FFECT OF THE M ASK B RANCH . ‘ W / O ’ AND ‘ W /’ I NDICATE
‘ WITHOUT ’ AND ‘ WITH ’, R ESPECTIVELY

the network (denoted as “w/o mask branch”) and report

Fig. 5. Qualitative results compared with bounding box detection (Point cloud
view). The detected bounding box contains parts of neighbor teeth, which its performance. The removal of the mask branch leads to
results in segmenting multiple tooth as a single tooth. Our highlight-based a notable drop in the overall performance of the network,
method can effectively identify the tooth region and obtain better segmentation highlighting the significance of the proposed mask branch.
results.
Furthermore, we explore the effects of different convolution
TABLE III kernel sizes on the mask branch. Specifically, we replace the
E FFECT OF THE N UMBER OT T OOTH A FFINITY M APS vanilla 1 × 1 convolution operator with a 3 × 3 convolution
operator. From Table IV, we can find that the 1×1 convolution
maintains higher performance and has a small computational
overhead compared to the 3 × 3 convolution.
3) Effect of Instance Branch: The tooth object affinity
module in the instance branch plays a crucial role in the object
affinity-based decoder. We investigate the effects of different
exhibits a slight increase. When setting N to 100, the seg- designs in Table V. Specifically, we evaluate the impact of
mentation performance degrades. To prevent the generation the number of groups on the model’s performance. When we
of redundant tooth instance activation maps in the network, use 4 groups convolution, the OA and mIOU increase by
we ultimately set N to 30 for aggregating instance information 1.29% and 2.43%, respectively, compared to using vanilla
in the tooth object affinity module. 1 × 1 convolution. These results indicate that adopting a
2) Effect of Mask Branch: We perform several ablation grouping design can improve the model accuracy of the tooth
experiments to assess the impact of the mask branch. As listed object affinity module. Therefore, we set the number of groups
in Table IV, we first remove the entire mask branch from to 4 in all experiments if not specified. Moreover, as the tooth

Authorized licensed use limited to: Indian Institute Of Technology Jammu. Downloaded on October 22,2024 at 12:34:01 UTC from IEEE Xplore. Restrictions apply.
5238 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 34, NO. 7, JULY 2024

Fig. 6. Visualization of the highlighted tooth regions (Mesh view) when N is set to 30. With the tooth object-aware optimal transport assignment (OOTA)
strategy in the training stage, we can match these regions and ground truth teeth in a one-to-one manner. The green boxes with tooth labels represent that the
tooth instance regions matched with the correspondent ground truth teeth, while the gray dashed boxes represent areas of low quality teeth that do not match
any ground truth teeth.

TABLE V TABLE VI
E FFECT OF THE I NSTANCE B RANCH E FFECT OF THE L OSS F UNCTIONS

As listed in Table VI, the dice loss function plays a significant

object affinity module enhances the contextual information of
role in enhancing segmentation performance, resulting in an
the tooth region through the tooth instance feature activation,
improvement of 0.73% in OA and 2.95% in mIOU compared
we further investigate the effects of the activation function
to the BCE loss function. Moreover, incorporating the BCE
σ (·) on segmentation performance focusing on the 4 groups
loss function further boosts the model performance, leading
convolution setting. Specifically, we compare the Softmax
xi to an additional improvement of 0.98% in OA and 2.18%
activation function (σ (x) = Pne e x j ) and Sigmoid activation
j=1 in mIOU. However, the combination of focal loss [49] and
function (σ (x) = 1+e1 −x ) for generating tooth object affinity dice loss proves to be ineffective in improving the network
maps. Here, x is a tooth affinity map, xi is an element performance.
in the feature map, and n is the dimension. Based on our 5) Effects of the Number of K : We conduct experiments to
experiments, we find that using 4 groups 1 × 1 convolution assess the impact of various K values on tooth segmentation
with Softmax in Foa leads to a reduction of 0.61% in OA performance, and the results are summarized in Table VII.
and 0.52% in mIOU compared to using Sigmoid activation. As the value of K increases, both overall accuracy (OA)
Similarly, in the case of 4 groups 3 × 3 convolution with and mean intersection over union (mIOU) tend to improve.
Softmax, we observe a decrease of 2.52% in mIOU. According Notably, when using K = 32, we achieve the highest OA
to these results, we employ Sigmoid activation to generate (97.49%) and mIOU (92.22%). However, increasing the K
tooth object activation maps in our final model. value from 32 to 40 lead to a slight drop in mIOU by 0.64%.
4) Effect of the Loss Functions: Several ablation experi- This drop is attributed to overly dense connections within
ments are conducted to demonstrate the efficacy of our hybrid the K-nearest neighbor (KNN) graph, introducing superfluous
mask loss functions Lmask . For the sake of simplicity during information extending beyond the boundaries of the current
training, we set the parameter weights of the point-wise binary tooth region. We select K = 32 for our experiments, as it
cross-entropy (BCE) loss function and the region-wise dice provides the optimal balance between information richness and
loss function to 0.5, following the approach used in [50]. segmentation accuracy.

TABLE VII with three encoders combined with the proposed decoder. All
E FFECT OF THE N UMBER OF K encoders achieve consistent improvement in OA and mIOU,
which indicates the good generalization ability.

D. Discussion
This paper presents a novel method for 3D dental model seg-
mentation based on a two-stream multi-scale feature encoder
TABLE VIII
and a tooth object affinity module. The proposed method
E FFECT OF THE F EATURE E MBEDDING S ETTINGS
demonstrates superior performance compared to state-of-
the-art methods in terms of tooth segmentation accuracy,
robustness, and completeness. By leveraging multi-scale fea-
ture extraction and incorporating normal vectors, the proposed
method mitigates semantic confusion caused by the similarity
of tooth location and morphology. This lead to improved
segmentation results, especially in cases where adjacent teeth
had similar characteristics.
Furthermore, the tooth object affinity module plays a crucial
role in accurately identifying tooth categories and delineating
tooth boundaries, even in challenging samples with missing
teeth or abnormal shapes and sizes. This module effectively
captures local and global tooth information and generates
reliable tooth instance activation maps, which enables precise
segmentation and avoids the inclusion of surrounding gum
areas as part of the tooth region.
Fig. 7. Comparison the segmentation performance of different feature
encoders under the proposed tooth object affinity module. Experiments are conducted on a benchmark dental dataset,
which demonstrates the effectiveness and superiority of the
proposed method. It outperforms existing methods based on
6) Effects of the Feature Embedding: Ablation experiments semantic and instance segmentation, showcasing its potential
are conducted to assess the impact of different feature embed- for various dental applications and clinical scenarios.
ding settings on tooth segmentation performance, with the Future work can focus on the enhancement of the proposed
results summarized in Table VIII. Notably, using either the method by exploring advanced feature encoding techniques,
coordinate stream or the normal stream in isolation yielded and the tooth object affinity module can be refined to achieve
less favorable segmentation performance compared to utilizing improved accuracy and robustness. Additionally, investigat-
coordinates and normal vectors concurrently. The combined ing the generalizability and transferability of the proposed
use of coordinates and normal vectors proved to be more approach to other medical imaging tasks would be valu-
effective in conveying position and structural information able for expanding its applicability in the broader field of
during tooth feature extraction. Furthermore, employing two computer-aided diagnosis and treatment planning.
separate streams for coordinate and normal vector information
extraction, as opposed to a single branch for both features, V. C ONCLUSION
lead to significant improvements in both overall accuracy
(OA) and mean intersection over union (mIOU) by 3.24% and This paper presents a novel method for accurate 3D dental
7.71%, respectively. This observation underscores the value of model segmentation in a highlighting-and-segmenting manner.
learning these features separately, facilitating the extraction The proposed method introduces a tooth object affinity module
of complementary information that better represents tooth to highlight the discriminative region of teeth. The final
positions and structures. tooth instance identification and segmentation results can be
7) Visualization of the Highlighted Tooth Regions: To fur- directly derived using a simple yet effective identification
ther explain how the instance activation maps distinguish head. Experimental results on a benchmark 3D tooth model
teeth, we provide a visualization of the highlighted tooth dataset demonstrate significant improvements in segmentation
regions from a 3D tooth model in a scenario where teeth are performance, which surpasses existing methods.
missing. Fig. 6 highlights the approximate regions of each
tooth. We can see that the tooth instance activation maps can R EFERENCES
effectively help to highlight teeth with different proportions [1] K. He, G. Gkioxari, P. Dollár, and R. Girshick, “Mask R-CNN,” in Proc.
IEEE Int. Conf. Comput. Vis. (ICCV), Oct. 2017, pp. 2980–2988.
and positions, and they also perform well in challenging cases
[2] X. Zhang, H. Li, F. Meng, Z. Song, and L. Xu, “Segmenting beyond
such as missing teeth. the bounding box for instance segmentation,” IEEE Trans. Circuits Syst.
8) Generalization of the Decoder: To evaluate the effects of Video Technol., vol. 32, no. 2, pp. 704–714, Feb. 2022.
the generalization ability of the object affinity-based decoder, [3] Y. Sun, L. Su, S. Yuan, and H. Meng, “DANet: Dual-branch activation
network for small object instance segmentation of ship images,” IEEE
we conduct experiments using different point cloud feature Trans. Circuits Syst. Video Technol., vol. 33, no. 11, pp. 6708–6720,
encoders with it. As depicted in Fig. 7, we show the results Nov. 2023.

Authorized licensed use limited to: Indian Institute Of Technology Jammu. Downloaded on October 22,2024 at 12:34:01 UTC from IEEE Xplore. Restrictions apply.
5240 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 34, NO. 7, JULY 2024

[4] B. Hu, S. Zhou, Z. Xiong, and F. Wu, “Cross-resolution distillation [27] C. Lian et al., “Deep multi-scale mesh feature learning for automated
for efficient 3D medical image registration,” IEEE Trans. Circuits Syst. labeling of raw dental surfaces from 3D intraoral scanners,” IEEE Trans.
Video Technol., vol. 32, no. 10, pp. 7269–7283, Oct. 2022. Med. Imag., vol. 39, no. 7, pp. 2440–2450, Jul. 2020.
[5] R. Nie, J. Cao, D. Zhou, and W. Qian, “Multi-source information [28] L. Zhang et al., “TSGCNet: Discriminative geometric feature learning
exchange encoding with PCNN for medical image fusion,” IEEE Trans. with two-stream graph convolutional network for 3D dental model
Circuits Syst. Video Technol., vol. 31, no. 3, pp. 986–1000, Mar. 2021. segmentation,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit.
[6] L. Zhao and W. Tao, “JSNet++: Dynamic filters and pointwise corre- (CVPR), Jun. 2021, pp. 6695–6704.
lation for 3D point cloud instance and semantic segmentation,” IEEE [29] J. Yu, M. Tan, H. Zhang, Y. Rui, and D. Tao, “Hierarchical deep
Trans. Circuits Syst. Video Technol., vol. 33, no. 4, pp. 1854–1867, click feature prediction for fine-grained image recognition,” IEEE Trans.
Apr. 2023. Pattern Anal. Mach. Intell., vol. 44, no. 2, pp. 563–578, Feb. 2022.
[7] T. D. Ngo, B.-S. Hua, and K. Nguyen, “ISBNet: A 3D point cloud [30] Z. Liu et al., “Hierarchical self-supervised learning for 3D tooth seg-
instance segmentation network with instance-aware sampling and box- mentation in intra-oral mesh scans,” IEEE Trans. Med. Imag., vol. 42,
aware dynamic convolution,” in Proc. IEEE/CVF Conf. Comput. Vis. no. 2, pp. 467–480, Feb. 2023.
Pattern Recognit. (CVPR), Jun. 2023, pp. 13550–13559. [31] J. Zhang, J. Yang, J. Yu, and J. Fan, “Semisupervised image classification
[8] J. Hou, X. Dai, Z. He, A. Dai, and M. Nießner, “Mask3D: Pretrain- by mutual learning of multiple self-supervised models,” Int. J. Intell.
ing 2D vision transformers by learning masked 3D priors,” in Proc. Syst., vol. 37, no. 5, pp. 3117–3141, May 2022.
IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2023, [32] J. Yu, Y. Rui, and D. Tao, “Click prediction for web image reranking
pp. 13510–13519. using multimodal sparse coding,” IEEE Trans. Image Process., vol. 23,
[9] F. G. Zanjani et al., “Mask-MCNet: Tooth instance segmentation no. 5, pp. 2019–2032, May 2014.
in 3D point clouds of intra-oral scans,” Neurocomputing, vol. 453, [33] H. W. Kuhn, “The Hungarian method for the assignment problem,” Nav.
pp. 286–298, Sep. 2021. Res. Logistics (NRL), vol. 52, no. 1, pp. 7–21, Feb. 2005.
[10] Z. Cui et al., “TSegNet: An efficient and accurate tooth segmentation [34] J. Wang, L. Song, Z. Li, H. Sun, J. Sun, and N. Zheng, “End-
network on 3D dental model,” Med. Image Anal., vol. 69, Apr. 2021, to-end object detection with fully convolutional network,” in Proc.
Art. no. 101949. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2021,
[11] L. Qiu, C. Ye, P. Chen, Y. Liu, X. Han, and S. Cui, “DArch: Dental arch pp. 15844–15853.
prior-assisted 3D tooth instance segmentation with weak annotations,” in [35] X. Zhu, W. Su, L. Lu, B. Li, X. Wang, and J. Dai, “Deformable
Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR). London, DETR: Deformable transformers for end-to-end object detection,” 2020,
U.K.: Dent, Jun. 2022, pp. 20720–20729. arXiv:2010.04159.
[12] Y. Tian et al., “3D tooth instance segmentation learning objectness and [36] T. Cheng et al., “Sparse instance activation for real-time instance
affinity in point cloud,” ACM Trans. Multimedia Comput., Commun., segmentation,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit.
Appl., vol. 18, no. 4, pp. 1–16, Nov. 2022. (CVPR), Jun. 2022, pp. 4423–4432.
[13] Y. Zhao et al., “TSASNet: Tooth segmentation on dental panoramic [37] L. Yi, W. Zhao, H. Wang, M. Sung, and L. J. Guibas, “GSPN: Generative
X-ray images by two-stage attention segmentation network,” Knowledge- shape proposal network for 3D instance segmentation in point cloud,”
Based Syst., vol. 206, Oct. 2020, Art. no. 106338. in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR),
[14] Y. Zheng, B. Chen, Y. Shen, and K. Shen, “TeethGNN: Semantic Jun. 2019, pp. 3942–3951.
3D teeth segmentation with graph neural networks,” IEEE Trans. Vis. [38] B. Yang et al., “Learning object bounding boxes for 3D instance
Comput. Graphics, vol. 29, no. 7, pp. 3158–3168, Jul. 2023. segmentation on point clouds,” in Proc. Adv. Neural Inf. Process. Syst.,
vol. 32, 2019, pp. 6737–6746.
[15] K. Wu, L. Chen, J. Li, and Y. Zhou, “Tooth segmentation on den-
tal meshes using morphologic skeleton,” Comput. Graph., vol. 38, [39] J. Hou, A. Dai, and M. Nießner, “3D-SIS: 3D semantic instance
pp. 199–211, Feb. 2014. segmentation of RGB-D scans,” in Proc. IEEE/CVF Conf. Comput. Vis.
Pattern Recognit. (CVPR), Jun. 2019, pp. 4416–4425.
[16] B.-J. Zou, S.-J. Liu, S.-H. Liao, X. Ding, and Y. Liang, “Interactive tooth
partition of dental mesh base on tooth-target harmonic field,” Comput. [40] T. Sun, G. Liu, R. Li, S. Liu, S. Zhu, and B. Zeng, “Quadratic
Biol. Med., vol. 56, pp. 132–144, Jan. 2015. terms based point-to-surface 3D representation for deep learning of
point cloud,” IEEE Trans. Circuits Syst. Video Technol., vol. 32, no. 5,
[17] Z. Li, X. Ning, and Z. Wang, “A fast segmentation method for STL teeth
pp. 2705–2718, May 2022.
model,” in Proc. IEEE/ICME Int. Conf. Complex Med. Eng., May 2007,
pp. 163–166. [41] W. Wang, R. Yu, Q. Huang, and U. Neumann, “SGPN: Similarity
group proposal network for 3D point cloud instance segmentation,”
[18] T. Kronfeld, D. Brunner, and G. Brunnett, “Snake-based segmentation in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., Jun. 2018,
of teeth from virtual dental casts,” Comput.-Aided Design Appl., vol. 7, pp. 2569–2578.
no. 2, pp. 221–233, Jan. 2010.
[42] J. Zhang, Y. Cao, and Q. Wu, “Vector of locally and adaptively aggre-
[19] Y. Kumar, R. Janardan, B. Larson, and J. Moon, “Improved segmentation gated descriptors for image feature representation,” Pattern Recognit.,
of teeth in dental models,” Comput.-Aided Design Appl., vol. 8, no. 2, vol. 116, Aug. 2021, Art. no. 107952.
pp. 211–224, Jan. 2011.
[43] L. Jiang, H. Zhao, S. Shi, S. Liu, C.-W. Fu, and J. Jia, “Point-
[20] C. Sinthanayothin and W. Tharanont, “Orthodontics treatment sim- Group: Dual-set point grouping for 3D instance segmentation,” in Proc.
ulation by teeth segmentation and setup,” in Proc. 5th Int. Conf. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2020,
Electr. Eng./Electron., Comput., Telecommun. Inf. Technol., May 2008, pp. 4866–4875.
pp. 81–84. [44] T. Vu, K. Kim, T. M. Luu, T. Nguyen, and C. D. Yoo, “SoftGroup for
[21] M. Yaqi and L. Zhongke, “Computer aided orthodontics treatment by 3D instance segmentation on point clouds,” in Proc. IEEE/CVF Conf.
virtual segmentation and adjustment,” in Proc. Int. Conf. Image Anal. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2022, pp. 2698–2707.
Signal Process., Apr. 2010, pp. 336–339. [45] R. Q. Charles, H. Su, M. Kaichun, and L. J. Guibas, “PointNet:
[22] D. Sun et al., “Automatic tooth segmentation and dense correspondence Deep learning on point sets for 3D classification and segmentation,”
of 3D dental model,” in Proc. Int. Conf. Med. Image Comput. Comput.- in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), Jul. 2017,
Assisted Intervent. Cham, Switzerland: Springer, 2020, pp. 703–712. pp. 77–85.
[23] T. Kondo, S. H. Ong, and K. W. C. Foong, “Tooth segmentation of dental [46] C. R. Qi et al., “PointNet++: Deep hierarchical feature learning on point
study models using range images,” IEEE Trans. Med. Imag., vol. 23, sets in a metric space,” in Proc. Adv. Neural Inf. Process. Syst., vol. 30,
no. 3, pp. 350–362, Mar. 2004. 2017.
[24] X. Xu, C. Liu, and Y. Zheng, “3D tooth segmentation and labeling [47] Y. Wang, Y. Sun, Z. Liu, S. E. Sarma, M. M. Bronstein, and
using deep convolutional neural networks,” IEEE Trans. Vis. Comput. J. M. Solomon, “Dynamic graph CNN for learning on point clouds,”
Graphics, vol. 25, no. 7, pp. 2336–2348, Jul. 2019. ACM Trans. Graph., vol. 38, no. 5, pp. 1–12, Oct. 2019.
[25] J. Zhang, C. Li, Q. Song, L. Gao, and Y.-K. Lai, “Automatic 3D [48] R. Stewart, M. Andriluka, and A. Y. Ng, “End-to-end people detection
tooth segmentation using convolutional neural networks in harmonic in crowded scenes,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit.
parameter space,” Graph. Models, vol. 109, May 2020, Art. no. 101071. (CVPR), Jun. 2016, pp. 2325–2333.
[26] F. G. Zanjani et al., “Deep learning approach to semantic segmentation [49] T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, “Focal loss for
in 3D point cloud intra-oral scans of teeth,” in Proc. Int. Conf. Med. dense object detection,” in Proc. IEEE Int. Conf. Comput. Vis. (ICCV),
Imag. With Deep Learn., 2019, pp. 557–571. Oct. 2017, pp. 2999–3007.

[50] P. Li et al., “Semantic graph attention with explicit anatomical associa- Fangcen Liu received the B.S. and M.S. degrees
tion modeling for tooth segmentation from CBCT images,” IEEE Trans. from the School of Communications and Informa-
Med. Imag., vol. 41, no. 11, pp. 3116–3127, Nov. 2022. tion Engineering, Chongqing University of Posts
[51] A. Ben-Hamadou et al., “Teeth3DS: A benchmark for teeth segmentation and Telecommunications, China, in 2018 and 2021,
and labeling from intra-oral 3D scans,” 2022, arXiv:2210.06094. respectively, where she is currently pursuing the
[52] M. Corsini, P. Cignoni, and R. Scopigno, “Efficient and flexible sampling Ph.D. degree. Her research interests include image
with blue noise properties of triangular meshes,” IEEE Trans. Vis. processing, deep learning, cross-modal retrieval, and
Comput. Graphics, vol. 18, no. 6, pp. 914–924, Jun. 2012. infrared small target detection.
[53] Y. Zhao et al., “Two-stream graph convolutional network for intra-oral
scanner image segmentation,” IEEE Trans. Med. Imag., vol. 41, no. 4,
pp. 826–835, Apr. 2022.
[54] C. Lian et al., “MeshSNet: Deep multi-scale mesh feature learning for
end-to-end tooth labeling on 3D dental surfaces,” in Proc. Int. Conf.
Med. Image Comput. Comput.-Assist. Intervent. Conf., Shenzhen, China,
Oct. 2019, pp. 837–845.

Pengcheng Li received the B.S. and M.S. degrees Deyu Meng (Senior Member, IEEE) received the
from the School of Communications and Informa- B.Sc., M.Sc., and Ph.D. degrees from Xi’an Jiao-
tion Engineering, Chongqing University of Posts tong University, Xi’an, China, in 2001, 2004, and
and Telecommunications, China, in 2017 and 2020, 2008, respectively. He was a Visiting Scholar with
respectively, where he is currently pursuing the Carnegie Mellon University, Pittsburgh, PA, USA,
Ph.D. degree. His research interests include med- from 2012 to 2014. He is currently a Professor
ical image processing, computer vision, and deep with the School of Mathematics and Statistics, Xi’an
learning. Jiaotong University, and an Adjunct Professor with
the Faculty of Information Technology, Macau Uni-
versity of Science and Technology, Taipa, Macau,
China. His research interests include model-based
deep learning, variational networks, and meta learning.
Chenqiang Gao received the B.S. degree in
computer science from the China University of Geo-
sciences, Wuhan, China, in 2004, and the Ph.D.
degree in control science and engineering from
the Huazhong University of Science and Technol-
ogy, Wuhan, in 2009. In August 2009, he joined
the School of Communications and Information
Yan Yan (Senior Member, IEEE) received the Ph.D.
Engineering, Chongqing University of Posts and
degree in computer science from the University of
Telecommunications (CQUPT), Chongqing, China.
Trento. He was an Assistant Professor with Texas
In September 2012, he joined the Informedia Group,
State University and a Research Fellow with the
School of Computer Science, Carnegie Mellon Uni-
University of Michigan and the University of Trento.
versity, Pittsburgh, PA, USA, where he was a Visiting Scholar on multimedia
He is currently a Gladwin Development Chair
event detection (MED) and surveillance event detection (SED). In April 2013,
Assistant Professor with the Department of Com-
he became a Post-Doctoral Fellow and continued work on MED and SED
puter Science, Illinois Institute of Technology. His
until March 2014, when he returned to CQUPT. He is currently a Professor
research interests include computer vision, machine
with CQUPT. His research interests include image processing, infrared target
learning, and multimedia.
detection, action recognition, and event detection.

Authorized licensed use limited to: Indian Institute Of Technology Jammu. Downloaded on October 22,2024 at 12:34:01 UTC from IEEE Xplore. Restrictions apply.

Image Feature Detectors and Descriptors: Ali Ismail Awad Mahmoud Hassaballah
No ratings yet
Image Feature Detectors and Descriptors: Ali Ismail Awad Mahmoud Hassaballah
437 pages
Wireless Network Lecture 1
100% (1)
Wireless Network Lecture 1
42 pages
Image Segmentation Using Deep Learning: A Survey
No ratings yet
Image Segmentation Using Deep Learning: A Survey
22 pages
From Classical Techniques To Convolution-Based Models: A Review of Object Detection Algorithms
No ratings yet
From Classical Techniques To Convolution-Based Models: A Review of Object Detection Algorithms
6 pages
Master DATA SCIENCE Rapport
No ratings yet
Master DATA SCIENCE Rapport
88 pages
MAJOR PROJECT REPORT (Finalsubmission) - Final Review
No ratings yet
MAJOR PROJECT REPORT (Finalsubmission) - Final Review
69 pages
Expert Systems With Applications: Kai Han, Victor S. Sheng, Yuqing Song, Yi Liu, Chengjian Qiu, Siqi Ma, Zhe Liu
No ratings yet
Expert Systems With Applications: Kai Han, Victor S. Sheng, Yuqing Song, Yi Liu, Chengjian Qiu, Siqi Ma, Zhe Liu
16 pages
3d Tooth Segmentation & Labeling Using R-Cnns
No ratings yet
3d Tooth Segmentation & Labeling Using R-Cnns
29 pages
Peerj Cs 07 620
No ratings yet
Peerj Cs 07 620
41 pages
Agile in Hardware - Applying-Agile-To-Ic-Development
100% (1)
Agile in Hardware - Applying-Agile-To-Ic-Development
52 pages
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
No ratings yet
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
18 pages
Vision
No ratings yet
Vision
24 pages
Multi-Granularity Tooth Analysis Via YOLO-based Object Detection Models For Effective Tooth Detection and Classification
No ratings yet
Multi-Granularity Tooth Analysis Via YOLO-based Object Detection Models For Effective Tooth Detection and Classification
12 pages
Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review
No ratings yet
Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review
29 pages
Lec 2 (Image Segemnation)
No ratings yet
Lec 2 (Image Segemnation)
52 pages
ISCOM RAX711 (A) Configuration Guide (Rel - 05) PDF
No ratings yet
ISCOM RAX711 (A) Configuration Guide (Rel - 05) PDF
416 pages
1 s2.0 S0262885620301748 Main
No ratings yet
1 s2.0 S0262885620301748 Main
17 pages
Stochastic Modeling For Medical Image Analysis 1st Edition DOCX PDF Download
100% (15)
Stochastic Modeling For Medical Image Analysis 1st Edition DOCX PDF Download
14 pages
Understanding Deep Learning Techniques For Image Segmentation
No ratings yet
Understanding Deep Learning Techniques For Image Segmentation
58 pages
Machine Learning Methods For Forest Image Analysis
No ratings yet
Machine Learning Methods For Forest Image Analysis
27 pages
Final Review
No ratings yet
Final Review
25 pages
Progress in Deep Learning-Based Dental and Maxillofacial Image Analysis - A Systematic Review
No ratings yet
Progress in Deep Learning-Based Dental and Maxillofacial Image Analysis - A Systematic Review
15 pages
Mesh Segmentation For Individual Teeth Based On Two-Stream GCN With Self-Attention
No ratings yet
Mesh Segmentation For Individual Teeth Based On Two-Stream GCN With Self-Attention
9 pages
Düzce University Journal of Science & Technology
No ratings yet
Düzce University Journal of Science & Technology
12 pages
2016 - Edge Detection and Feature Extraction For Dental X-Ray
No ratings yet
2016 - Edge Detection and Feature Extraction For Dental X-Ray
13 pages
A Deep Learning Approach To Automatic Teeth Detection and Numbering Based On Object Detection in Dental Periapical Films - Enhanced Reader
No ratings yet
A Deep Learning Approach To Automatic Teeth Detection and Numbering Based On Object Detection in Dental Periapical Films - Enhanced Reader
12 pages
A Benchmark Dual-Modality Dental Imaging Dataset A
No ratings yet
A Benchmark Dual-Modality Dental Imaging Dataset A
13 pages
Tooth Instance Segmentation On Panoramic Dental Radiographs Using U Nets and Morphological Processin
No ratings yet
Tooth Instance Segmentation On Panoramic Dental Radiographs Using U Nets and Morphological Processin
9 pages
Icpr2022 Odon
No ratings yet
Icpr2022 Odon
8 pages
Jsir (83) 05 531-543
No ratings yet
Jsir (83) 05 531-543
13 pages
Exploring The Applications of Artificial Intelligence in Dental Image Detection A Systematic ReviewDiagnostics
No ratings yet
Exploring The Applications of Artificial Intelligence in Dental Image Detection A Systematic ReviewDiagnostics
28 pages
Automatic Cephalometric Landmark Detection On X-Ray Images Using A Deep-Learning Method
No ratings yet
Automatic Cephalometric Landmark Detection On X-Ray Images Using A Deep-Learning Method
16 pages
Tooth Detection From Panoramic Radiographs Using Deep Learning
No ratings yet
Tooth Detection From Panoramic Radiographs Using Deep Learning
11 pages
Exploring Classification of Topological Priors With Machine Learning For Feature Extraction A
No ratings yet
Exploring Classification of Topological Priors With Machine Learning For Feature Extraction A
14 pages
IA e Ortodontia
No ratings yet
IA e Ortodontia
9 pages
Electronics 12 01199
No ratings yet
Electronics 12 01199
24 pages
Contour Proposal Networks For Biomedical Instance Compressed
No ratings yet
Contour Proposal Networks For Biomedical Instance Compressed
17 pages
Tooth Detection and Numbering in Panoramic Radiographs Using Convolutional Neural Networks
No ratings yet
Tooth Detection and Numbering in Panoramic Radiographs Using Convolutional Neural Networks
10 pages
Transformer-Based Framework For Accurate Segmentation of High-Resolution Images in Structural Health Monitoring
No ratings yet
Transformer-Based Framework For Accurate Segmentation of High-Resolution Images in Structural Health Monitoring
15 pages
RD Sharma Solutions Class 10 Chapter 4 Fill in The Blanks Quadratic Equations - Study Path
No ratings yet
RD Sharma Solutions Class 10 Chapter 4 Fill in The Blanks Quadratic Equations - Study Path
14 pages
Medical Image Segmentation Using Deep Learning: A Survey
No ratings yet
Medical Image Segmentation Using Deep Learning: A Survey
23 pages
Tooth Detection and Numbering in Panoramic Radiographs Using Convolutional Neural Networks
No ratings yet
Tooth Detection and Numbering in Panoramic Radiographs Using Convolutional Neural Networks
11 pages
Exploring Classification of Topological Priors With Machine Learning For Feature Extraction
No ratings yet
Exploring Classification of Topological Priors With Machine Learning For Feature Extraction
14 pages
SDPT Semantic-Aware Dimension-Pooling Transformer For Image Segmentation
No ratings yet
SDPT Semantic-Aware Dimension-Pooling Transformer For Image Segmentation
13 pages
Eofficequick Startguide: Click Functionality or Scroll Down
No ratings yet
Eofficequick Startguide: Click Functionality or Scroll Down
6 pages
Dental
No ratings yet
Dental
11 pages
Project Report BCLN SAM
No ratings yet
Project Report BCLN SAM
11 pages
A Fully Automated Method For 3D Individual Tooth I
No ratings yet
A Fully Automated Method For 3D Individual Tooth I
8 pages
Stochastic Modeling For Medical Image Analysis - 1st Edition High-Quality Download
No ratings yet
Stochastic Modeling For Medical Image Analysis - 1st Edition High-Quality Download
17 pages
CurrentMethodsInImageSegmentation Phan2000 PDF
No ratings yet
CurrentMethodsInImageSegmentation Phan2000 PDF
27 pages
Deep Learning For X Ray Image To Text Generation
No ratings yet
Deep Learning For X Ray Image To Text Generation
4 pages
U-KAN Makes Strong Backbone For Medical Image Segmentation and Generation
No ratings yet
U-KAN Makes Strong Backbone For Medical Image Segmentation and Generation
14 pages
Thesis AlexanderJaus BIBTEX
No ratings yet
Thesis AlexanderJaus BIBTEX
9 pages
cvpr07 Final
No ratings yet
cvpr07 Final
8 pages
GPT 4o
No ratings yet
GPT 4o
5 pages
Learning Beyond Human Expertise With Generative Models For Dental Restorations
No ratings yet
Learning Beyond Human Expertise With Generative Models For Dental Restorations
18 pages
Resumen - Mineria de Datos - Elizabeth - Mamani - Gutierrez
No ratings yet
Resumen - Mineria de Datos - Elizabeth - Mamani - Gutierrez
5 pages
Dental Image Processing
No ratings yet
Dental Image Processing
6 pages
Image
No ratings yet
Image
16 pages
A Study On Image Categorization Techniques
No ratings yet
A Study On Image Categorization Techniques
7 pages
Lecture 13 Image Segmentation Using Convolutional Neural Network
No ratings yet
Lecture 13 Image Segmentation Using Convolutional Neural Network
9 pages
A New Approach For Clustering of X-Ray Images
No ratings yet
A New Approach For Clustering of X-Ray Images
5 pages
Decision Forests For Computer Vision and Medical I
No ratings yet
Decision Forests For Computer Vision and Medical I
2 pages
MIL Syllabus
No ratings yet
MIL Syllabus
15 pages
Office 365 ISO Audit 2020
No ratings yet
Office 365 ISO Audit 2020
28 pages
MEC170x Data Sheet DS00002206H
No ratings yet
MEC170x Data Sheet DS00002206H
662 pages
FDS Unit - 2
No ratings yet
FDS Unit - 2
119 pages
Unit 5 Inventions: Lesson 1 Getting Started
No ratings yet
Unit 5 Inventions: Lesson 1 Getting Started
49 pages
Thesis Theme Wordpress Download Free
100% (3)
Thesis Theme Wordpress Download Free
8 pages
Agile For PMP Exam Prep Training
No ratings yet
Agile For PMP Exam Prep Training
150 pages
Product Description Fdams Dfdau (
No ratings yet
Product Description Fdams Dfdau (
14 pages
NIST - sp.800 40r4 Draft
No ratings yet
NIST - sp.800 40r4 Draft
27 pages
Rukavicka RejectionLaplacesDemon 2014
No ratings yet
Rukavicka RejectionLaplacesDemon 2014
2 pages
Ss3 Ict 2nd Term Lesson Plan - 012957
No ratings yet
Ss3 Ict 2nd Term Lesson Plan - 012957
17 pages
(Ebook PDF) Materials: Engineering, Science, Processing and Design 4th Edition PDF Download
No ratings yet
(Ebook PDF) Materials: Engineering, Science, Processing and Design 4th Edition PDF Download
49 pages
Gamayas Technology Services PVT LTD
No ratings yet
Gamayas Technology Services PVT LTD
13 pages
isCOBOL2023r1 Releaseoverview
No ratings yet
isCOBOL2023r1 Releaseoverview
33 pages
Unit-4 NLP
No ratings yet
Unit-4 NLP
21 pages
Cyber Crimes PPT Final
No ratings yet
Cyber Crimes PPT Final
11 pages
Annotated Follow-Along Guide - Construct A Naive Bayes Model With Python
No ratings yet
Annotated Follow-Along Guide - Construct A Naive Bayes Model With Python
9 pages
F35 Ordercode
No ratings yet
F35 Ordercode
10 pages
Sign Language Detection Using CNN
No ratings yet
Sign Language Detection Using CNN
5 pages
The scrapbook-s-WPS Office
No ratings yet
The scrapbook-s-WPS Office
4 pages
Tails Stuck
No ratings yet
Tails Stuck
2 pages
ReleaseNote FileList of X64W11 22H2 SWP K6502VU 04.00
No ratings yet
ReleaseNote FileList of X64W11 22H2 SWP K6502VU 04.00
6 pages
【Permutation and Combination 排列與組合 (1) 】: 1. Counting Principle (計數原理)
No ratings yet
【Permutation and Combination 排列與組合 (1) 】: 1. Counting Principle (計數原理)
8 pages
Nominee - WomenTech Network
No ratings yet
Nominee - WomenTech Network
5 pages
(Digital Substation and Process Bus "Part 2 of 5") Merging Unit and Non-Conventional Instrumentation Transformer "Eng. Mohamed Younis"
No ratings yet
(Digital Substation and Process Bus "Part 2 of 5") Merging Unit and Non-Conventional Instrumentation Transformer "Eng. Mohamed Younis"
5 pages
Digital Dentistry: A Review of Modern Innovations for CAD/CAM Generated Restoration
From Everand
Digital Dentistry: A Review of Modern Innovations for CAD/CAM Generated Restoration
Vladyslav Pereverzyev
No ratings yet
Treatment Planning Single Maxillary Anterior Implants for Dentists
From Everand
Treatment Planning Single Maxillary Anterior Implants for Dentists
Dr. Nkem Obiechina
No ratings yet

THISNet Tooth Instance Segmentation On 3D Dental Models Via Highlighting Tooth Regions

Uploaded by

THISNet Tooth Instance Segmentation On 3D Dental Models Via Highlighting Tooth Regions

Uploaded by

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 34, NO.

7, JULY 2024 5229

THISNet: Tooth Instance Segmentation on 3D

effectively integrates global contextual information. This

Algorithm 1 Object-Aware Optimal Transport Assignment IV. E XPERIMENTS

Fig. 3. The segmentation quality comparison for tooth segmentation under

the network (denoted as “w/o mask branch”) and report

As listed in Table VI, the dice loss function plays a significant

You might also like