23 points | by pizza21 hours ago
Are UCNP heterostructures also the most promising area of application, or are you looking to apply the method in other areas now that you've validated it in the stringent case?
First off, the ML surrogate is many orders of magnitude faster than direct kMC simulation. Each global optimization involved up to 500 sequential predictions of emission intensity with different nanoparticle structures (and gradients - I'll come back to those). For the largest particles that we optimized, running kMC on only the final optimized structure (for validation) took three months! So running 500 sequential kMC simulations would take approximately 125 years - not ideal. In contrast, all 81 global optimizations took two days with our trained hetero-GNN.
Back to the gradients - running the kMC simulations involves generating four different randomly doped 3D nanoparticle structures according to the shell thicknesses and dopant concentrations for the structure of interest, and then running four kMC trajectories with different random seeds for each structure. The final predicted value for emission (based on the number of emitted photons within the energy range of interest) is averaged over all sixteen trajectories. kMC is fundamentally stochastic, and not differentiable. But even if you could make it so with the adjoint-variable method (which I am admittedly not super familiar with), or maybe by writing your kMC code in a language with foundational auto-diff capabilities, once you move from a description of layer thicknesses and dopant concentrations to an ensemble of 3D structures/trajectories, I still don't think that the gradient of emission with respect to layer thicknesses and dopant concentrations would be accessible.
Thoughts? Thanks!
> So running 500 sequential kMC simulations would take approximately 125 years - not ideal.
Ah I see. Ya it’s hard to get around this…
> is fundamentally stochastic, and not differentiable
This is actually a common “inverse design pattern” for a variety of applications, and luckily there are tricks to efficiently compute gradients here. In my domain (nanophotonics) we’re often simulating incoherent sources, which are similarly stochastic. But there are ways to reformulate the problem to drastically reduce the number of forward and adjoint solves you need during the design process [1].
That being said, given the cost of the forward problem (3 months per iteration!) this doesn’t help you much…