3260 papers • 126 benchmarks • 313 datasets
U-Net is an architecture for semantic segmentation. It consists of a contracting path (Up to down) and an expanding path (Down to up). During the contraction, the spatial information is reduced while feature information is increased. The contracting path follows the typical architecture of a convolutional network. It consists of the repeated application of two 3x3 convolutions (unpadded convolutions), each followed by a rectified linear unit (ReLU) and a 2x2 max pooling operation with stride 2 for downsampling. At each downsampling step, we double the number of feature channels. Every step in the expansive path consists of an upsampling of the feature map followed by a 2x2 convolution (“up-convolution”) that halves the number of feature channels, a concatenation with the correspondingly cropped feature map from the contracting path, and two 3x3 convolutions, each followed by a ReLU. The cropping is necessary due to the loss of border pixels in every convolution. At the final layer, a 1x1 convolution is used to map each 64-component feature vector to the desired number of classes. In total the network has 23 convolutional layers.
(Image credit: Papersgraph)
These leaderboards are used to track progress in unet-segmentation-10
Use these libraries to find unet-segmentation-10 models and implementations
No subtasks available.
The proposed UCTransNet is able to replace the original skip connection to solve the semantic gaps for an accurate automatic medical image segmentation and achieves consistent improvements over the state-of-the-art for semantic segmentation across different datasets and conventional architectures involving transformer or U-shaped framework.
3D Dense-U-Net neural network architecture implementing densely connected layers is introduced, which can process image data in 3D and has achieved accuracy of 99.72% on MRI brain dataset, which outperformed results achieved by human expert.
Automatic adipose tissue segmentation is feasible in 3D whole-body MR data sets and is generalizable to different epidemiological cohort studies with the proposed DCNet.
The Generally Nuanced Deep Learning Framework (GaNDLF) is proposed, which aims to provide an end-to-end solution for all DL-related tasks, to tackle problems in medical imaging and provide a robust application framework for deployment in clinical workflows.
It is shown that even a lightweight architecture is capable to perform on a range of complex textures and is able to distinguish the holes drilling operations’ semantical information from the rest of the furniture board and conveyor context.
The approach is able to accurately detect crop classes with an overall accuracy of about 93% and that the network can detect discriminatory time intervals in which crop is cultivated and provide further information to the end-user.
A machine learning-based method for extracting H i sources from the 3D spectral data obtained from the Commensal Radio Astronomy FAST Survey (CRAFTS), which utilizes the advanced 3D-Unet segmentation architecture and employs an elongated convolution kernel to effectively capture the intricate structures of H i sources.
A structured approach to introduce spatially dynamic components to the ViT-UNet enables the model to effectively capture features of target objects with diverse appearances, and is integrated into a novel architecture, termed AgileFormer.
Adding a benchmark result helps the community track progress.