当前位置: X-MOL 学术Mach. Learn. Sci. Technol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Warwick electron microscopy datasets
Machine Learning: Science and Technology ( IF 6.013 ) Pub Date : 2020-09-18 , DOI: 10.1088/2632-2153/ab9c3c
Jeffrey M Ede

Large, carefully partitioned datasets are essential to train neural networks and standardize performance benchmarks. As a result, we have set up new repositories to make our electron microscopy datasets available to the wider community. There are three main datasets containing 19769 scanning transmission electron micrographs, 17266 transmission electron micrographs, and 98340 simulated exit wavefunctions, and multiple variants of each dataset for different applications. To visualize image datasets, we trained variational autoencoders to encode data as 64-dimensional multivariate normal distributions, which we cluster in two dimensions by t-distributed stochastic neighbor embedding. In addition, we have improved dataset visualization with variational autoencoders by introducing encoding normalization and regularization, adding an image gradient loss, and extending t-distributed stochastic neighbor embedding to account for encoded standard deviations. Our datasets, source code, pr...

中文翻译:

Warwick电子显微镜数据集

精心划分的大型数据集对于训练神经网络和标准化性能基准至关重要。因此,我们建立了新的存储库,以使我们的电子显微镜数据集可用于更广泛的社区。有三个主要数据集,分别包含19769扫描透射电子显微照片,17266透射电子显微照片和98340模拟出射波函数,以及每个数据集针对不同应用的多种变体。为了可视化图像数据集,我们训练了变分自动编码器,将数据编码为64维多元正态分布,并通过t分布随机邻居嵌入将其二维聚类。此外,我们通过引入编码归一化和正则化,通过变分自动编码器改善了数据集的可视化效果,增加图像梯度损失,并扩展t分布随机邻居嵌入以说明编码标准差。我们的数据集,源代码,程序...
更新日期:2020-09-20
down
wechat
bug