手把手复现DuDoDp：基于PyTorch和DeepLesion数据集的CT金属伪影消除实战教程

张

张建站

2026/5/4 4:10:19

10分钟阅读

手把手复现DuDoDp：基于PyTorch和DeepLesion数据集的CT金属伪影消除实战教程

基于PyTorch的CT金属伪影消除实战从DeepLesion到双域扩散模型医疗影像中的金属伪影问题一直是CT图像处理领域的痛点。当患者体内存在金属植入物时重建后的CT图像会出现严重的条纹状伪影严重影响诊断准确性。传统方法如线性插值和归一化金属伪影减少NMAR虽然简单有效但在复杂场景下表现有限。近年来深度学习技术为这一问题带来了新的解决方案但大多数方法依赖于大量配对数据训练而临床环境中获取这类数据成本极高。1. 环境准备与数据预处理1.1 硬件与软件配置要实现高效的CT金属伪影消除合理的硬件配置和软件环境至关重要。推荐使用NVIDIA RTX 3090或更高性能的GPU配备至少24GB显存。以下是推荐的开发环境配置# 创建conda环境 conda create -n dudodp python3.8 conda activate dudodp # 安装PyTorch及相关库 pip install torch1.12.1cu113 torchvision0.13.1cu113 -f https://download.pytorch.org/whl/torch_stable.html pip install numpy1.23.4 scipy1.9.3 matplotlib3.6.1 pip install tqdm4.64.1 scikit-image0.19.3对于CT图像处理还需要安装专门的医学影像处理库# 安装医学影像专用库 pip install pydicom2.3.1 SimpleITK2.2.11.2 DeepLesion数据集处理DeepLesion数据集包含大量CT扫描图像是训练金属伪影消除模型的理想选择。数据集预处理流程包括以下几个关键步骤数据下载与解压从NIH官网获取DeepLesion数据集解压后得到DICOM格式文件格式转换将DICOM转换为更易处理的NIfTI或PNG格式数据筛选排除含有金属伪影的图像确保训练集纯净数据增强应用旋转、翻转等变换增加数据多样性import pydicom import numpy as np from skimage import exposure def dicom_to_npy(dicom_path): 将DICOM文件转换为numpy数组并进行标准化 dicom pydicom.dcmread(dicom_path) img dicom.pixel_array.astype(np.float32) # 应用CT窗口化 img exposure.rescale_intensity(img, in_range(dicom.WindowCenter-dicom.WindowWidth/2, dicom.WindowCenterdicom.WindowWidth/2), out_range(0, 1)) return img2. 双域扩散模型架构设计2.1 U-Net基础架构扩散模型的核心是U-Net架构它能有效捕捉图像的局部和全局特征。我们的实现基于PyTorch主要包含下采样和上采样两部分import torch import torch.nn as nn class DoubleConv(nn.Module): U-Net中的双卷积块 def __init__(self, in_channels, out_channels): super().__init__() self.double_conv nn.Sequential( nn.Conv2d(in_channels, out_channels, kernel_size3, padding1), nn.BatchNorm2d(out_channels), nn.ReLU(inplaceTrue), nn.Conv2d(out_channels, out_channels, kernel_size3, padding1), nn.BatchNorm2d(out_channels), nn.ReLU(inplaceTrue) ) def forward(self, x): return self.double_conv(x) class DownBlock(nn.Module): 下采样块 def __init__(self, in_channels, out_channels): super().__init__() self.maxpool_conv nn.Sequential( nn.MaxPool2d(2), DoubleConv(in_channels, out_channels) ) def forward(self, x): return self.maxpool_conv(x)2.2 扩散过程实现扩散模型包含前向扩散和反向去噪两个过程。前向过程逐渐向图像添加高斯噪声反向过程则学习去除噪声class DiffusionModel(nn.Module): def __init__(self, T1000, beta_start1e-4, beta_end0.02): super().__init__() self.T T self.betas torch.linspace(beta_start, beta_end, T) self.alphas 1. - self.betas self.alpha_bars torch.cumprod(self.alphas, dim0) def forward_diffusion(self, x0, t): 前向扩散过程 noise torch.randn_like(x0) alpha_bar_t self.alpha_bars[t].view(-1, 1, 1, 1) xt torch.sqrt(alpha_bar_t) * x0 torch.sqrt(1 - alpha_bar_t) * noise return xt, noise def reverse_process(self, model, xt, t): 反向去噪过程 pred_noise model(xt, t) alpha_t self.alphas[t].view(-1, 1, 1, 1) alpha_bar_t self.alpha_bars[t].view(-1, 1, 1, 1) beta_t self.betas[t].view(-1, 1, 1, 1) x0_pred (xt - torch.sqrt(1 - alpha_bar_t) * pred_noise) / torch.sqrt(alpha_bar_t) x0_pred torch.clamp(x0_pred, -1., 1.) if t[0] 0: return x0_pred else: noise torch.randn_like(xt) mean (xt - beta_t * pred_noise / torch.sqrt(1 - alpha_bar_t)) / torch.sqrt(alpha_t) variance torch.sqrt(beta_t) return mean variance * noise3. 双域迭代修复流程3.1 弦图域修复弦图sinogram是CT投影数据的原始表示形式。金属伪影在弦图中表现为数据缺失区域。我们的修复流程首先在弦图域进行def sinogram_inpainting(sinogram, metal_mask, fbp_operator, model, t): 弦图域修复 :param sinogram: 含金属伪影的弦图 :param metal_mask: 金属影响区域的二值掩模 :param fbp_operator: 滤波反投影算子 :param model: 预训练扩散模型 :param t: 当前时间步 # 从扩散模型获取先验图像 prior_img model.predict_prior(sinogram, t) # 计算先验弦图 prior_sinogram fbp_operator.forward_project(prior_img) # 补全受金属影响的区域 completed_sinogram torch.where(metal_mask 0.5, prior_sinogram, sinogram) # 重建图像 reconstructed_img fbp_operator.back_project(completed_sinogram) return reconstructed_img, completed_sinogram3.2 图像域融合弦图域修复后我们进一步在图像域进行融合以消除可能残留的伪影def image_domain_fusion(reconstructed_img, prior_img, metal_art_img, t, T): 图像域融合 :param reconstructed_img: 弦图修复后的重建图像 :param prior_img: 扩散模型生成的先验图像 :param metal_art_img: 原始含金属伪影的图像 :param t: 当前时间步 :param T: 总时间步数 # 计算动态权重 alpha 0.5 * (1 np.cos(np.pi * t / T)) # 融合先验图像和重建图像 fused_img alpha * prior_img (1 - alpha) * reconstructed_img # 进一步融合金属伪影图像 final_img 0.7 * fused_img 0.3 * metal_art_img return final_img4. 动态权重掩码实现动态权重掩码是双域方法的关键创新它能根据金属伪影的严重程度自适应调整修复强度class DynamicWeightMask(nn.Module): def __init__(self, initial_weight0.5, decay_rate0.95): super().__init__() self.initial_weight initial_weight self.decay_rate decay_rate def forward(self, metal_mask, t, max_t): 生成动态权重掩码 :param metal_mask: 金属区域掩模 :param t: 当前时间步 :param max_t: 最大时间步 # 基础权重 base_weight self.initial_weight * (self.decay_rate ** (t / max_t * 10)) # 生成空间变化的权重图 distance_map self._compute_distance_map(metal_mask) weight_map base_weight * (1 0.5 * torch.sigmoid(5 - distance_map)) return weight_map def _compute_distance_map(self, mask): 计算每个像素到金属区域的距离 from scipy.ndimage import distance_transform_edt mask_np mask.squeeze().cpu().numpy() dist_map distance_transform_edt(1 - mask_np) return torch.from_numpy(dist_map).float().to(mask.device)5. 完整训练与推理流程5.1 模型训练训练过程分为两个阶段首先训练扩散模型然后微调双域修复流程def train_diffusion(model, dataloader, optimizer, device, epochs): model.train() for epoch in range(epochs): for i, (clean_imgs, _) in enumerate(dataloader): clean_imgs clean_imgs.to(device) # 随机采样时间步 t torch.randint(0, model.T, (clean_imgs.size(0),), devicedevice) # 前向扩散过程 noisy_imgs, noise model.forward_diffusion(clean_imgs, t) # 预测噪声 pred_noise model.reverse_process(noisy_imgs, t) # 计算损失 loss nn.MSELoss()(pred_noise, noise) # 反向传播 optimizer.zero_grad() loss.backward() optimizer.step() if i % 100 0: print(fEpoch [{epoch1}/{epochs}], Step [{i}/{len(dataloader)}], Loss: {loss.item():.4f})5.2 推理流程完整的金属伪影消除推理流程如下def inference_pipeline(metal_art_img, metal_mask, model, fbp_operator, T100): 完整推理流程 :param metal_art_img: 含金属伪影的CT图像 :param metal_mask: 金属区域掩模 :param model: 预训练扩散模型 :param fbp_operator: 滤波反投影算子 :param T: 扩散步数 # 初始化 xt torch.randn_like(metal_art_img) results [] # 获取原始弦图 original_sinogram fbp_operator.forward_project(metal_art_img) # 迭代去噪 for t in reversed(range(T)): # 弦图域修复 recon_img, completed_sino sinogram_inpainting( original_sinogram, metal_mask, fbp_operator, model, t ) # 获取扩散先验 prior_img model.predict_prior(xt, t) # 图像域融合 fused_img image_domain_fusion( recon_img, prior_img, metal_art_img, t, T ) # 更新xt xt model.reverse_process(fused_img, t) results.append(fused_img.detach().cpu()) return results[-1] # 返回最终结果在实际项目中我们发现金属伪影消除的效果很大程度上取决于金属掩模的准确性。使用阈值法生成的掩模往往不够精确而采用深度学习分割网络如U-Net可以显著提高掩模质量。此外对于不同解剖部位如头部、腹部、骨盆可能需要调整动态权重掩码的参数以获得最佳效果。

Vue 3项目从零到上线：除了npm install，你还需要配置这些（Node.js v22.4.1环境）

Vue 3项目从零到上线：除了npm install，你还需要配置这些（Node.js v22.4.1环境） 在Node.js v22.4.1环境下启动一个Vue 3项目，远不止运行npm install那么简单。现代前端工程化要求开发者掌握从脚手架选择到生产环境优化的…...

2026/5/4 4:05:25 阅读更多 →

ESP32 Marauder 5G - Apex 5模块：无线安全研究的革新利器

1. ESP32 Marauder 5G - Apex 5模块深度解析作为Flipper Zero生态中最新推出的多功能射频模块，ESP32 Marauder 5G - Apex 5代表了当前开源硬件在无线安全研究领域的最高集成度。这款由HoneyHoneyTrading设计的扩展模块，通过ESP32-C5芯片实现了前所未有的…...

2026/5/4 3:57:27 阅读更多 →

别再只用Matplotlib了！用PyEcharts在VSCode里5分钟搞定动态交互图表（附完整代码）

用PyEcharts在VSCode中5分钟打造专业级交互式数据看板当数据分析师还在为Matplotlib的静态图表反复调整参数时，前沿团队早已用PyEcharts构建出可实时交互的商业看板。这不是工具迭代，而是可视化思维的代际跨越——在VSCode中，仅需5分钟就能将…...

2026/5/4 3:54:39 阅读更多 →

C语言RTOS多核协同失效真相：Cache一致性缺失、内存序乱序、GCC -O2优化陷阱——三重危机诊断工具链实战

更多请点击： https://intelliparadigm.com 第一章：C语言RTOS多核协同失效的系统性认知在嵌入式实时系统中，基于C语言开发的RTOS（如FreeRTOS、Zephyr或RT-Thread）常被移植至ARM Cortex-A/R系列或多核RISC-V SoC平台。…...

2026/5/3 0:03:42 阅读更多 →

Zotero GPT终极指南：用AI轻松读懂学术文献的研究态度与情感倾向

Zotero GPT终极指南：用AI轻松读懂学术文献的研究态度与情感倾向【免费下载链接】zotero-gpt GPT Meet Zotero. 项目地址: https://gitcode.com/gh_mirrors/zo/zotero-gpt 你是否曾被海量学术文献淹没？是否在阅读论文时难以快速把握作者的研究立场…...

2026/5/3 0:03:57 阅读更多 →