We experimentally demonstrated a fast optimization algorithm based on the method of gradient descent for
achieving optimum spectral response of high-order silicon microring optical filters. The filter optimization was
performed on a 4th-order serially-coupled silicon microring filter by thermo-optically tuning the microring resonances using Ti/W heaters. Three different optimization objective functions were used to obtain the optimum
filter shape, namely, the single-wavelength method, the dual-wavelength method, and the total transmitted power
method. The efficacy of each optimization method was evaluated and compared based on the number of required
iterations, the ideality of optimized response, and the wavelength tuning accuracy.
Fast Thermo-Optic Optimization of SOI Microring Filters
1. Fast Thermo-Optic Optimization of
High-Order SOI Microring Optical Filters
by Method of Gradient Descent
Tyler J. Zimmerling, Yang Ren, Huynh Ngo, Vien Van
University of Alberta, Edmonton, Alberta, Canada
3. Motivation 2
2nd Order Systems
Tune 1 resonance to the other
Higher Order Systems
Complex, co-ordinate descent, Nelder-Mead algorithm
Filter shape, in-band ripple
Investigate and compare 3 different optimization objective
functions:
Single-Wavelength
Dual-Wavelength
Total Transmitted Power
9. Total Transmitted Power Method 8
12345678910
Φ 𝑇𝑇𝑃 =
𝜆3𝑑𝐵−
𝜆3𝑑𝐵+
𝑇 𝜆
𝑁
Trials 1-10
Black lines: optimization region,
1546.95 to 1547.15 nm.
Dashed cyan spectrum: initial
response after coordinate
descent.
10. Results I 9
Single-λ and TTP converge ~5 iterations
Dual-λ fluctuates around optimized result
TTP: 20% decrease in 3 dB bandwidth
Dual-λ: Outperforms in center wavelength accuracy with ∆λ = 10𝑝𝑚
Single-λ: Least in-band ripple at 0.53 dB
12. Conclusions and Future Work 11
Optimization Method Bandwidth (GHz) Δλ (pm) In-Band Ripple (dB) Iterations
Single-Wavelength 25.1 30 0.53 5
Dual-Wavelength 25.1 10 2.36 9
TTP 20.0 50 2.56 5
Single-Wavelength: Worst center λ accuracy, best in-band ripple
Dual-Wavelength: Best center λ accuracy, significant in-band ripple
TTP: Most stable, worst filter shape and in-band ripple
Focus on In-Band Ripple
Order of Optimization
Weighted Objective Functions
Editor's Notes
**Good morning ladies and gentlemen, my name is Tyler Zimmerling.
**I am a Master’s student in Electrical and Computer Engineering at the University of Alberta in Edmonton, Alberta, Canada.
**Today I will be presenting our demonstration of fast thermo-optic optimization of high-order silicon-on-insulator microring optical filters using the method of gradient descent.
**Microring optical filters are a subset of devices that belong to the category of photonic integrated circuits.
**Recently, silicon photonics has become the dominant technology for realizing these PICs, especially for optical communication and signal processing applications, such as Wavelength Division Multiplexing.
**The inherent fabrication errors and defects in the manufacturing of these PICs makes post-fabrication optimization necessary.
**This can be achieved through the thermo-optic effect using heaters fabricated on top of the devices, by direct resistive heating of doped silicon waveguides, or by free carrier injection.
**The process is straightforward for 2-ring systems as one can simply tune the resonant frequency of one ring until it lines up with the other.
**With increasing filter order, tuning the microring resonances becomes complex, requiring tuning method, such as coordinate descent or the Nelder-Mead simplex algorithm.
**Additionally, filter shapes with designed in-band ripple require further consideration for effective optimization.
**These methods typically require that the resonant frequencies are first roughly aligned to obtain some initial spectral response to be optimized.
**In previous studies the objective function is defined by maximizing the optical transmission at the desired center wavelength.
**We aim to investigate and compare the performance of the optimization method with respect to various filter characteristics using three different optimization objective functions:
**Single-wavelength method, which maximizes transmitted optical power at the center wavelength of the passband
**Dual-wavelength method, which maximizes the average of powers at two wavelengths tuned approximately to the 3dB bandwidth points
**Total transmitted power (TTP) method, which maximizes the average of the power spectrum across the passband.
**We applied each method to optimize the filter response of a 4th-order Vernier microring filter design.
**The filter design was a flat-top Vernier filter with a 3dB bandwidth of 25 GHz and an FSR of 158 nm for TE polarization.
**To achieve the Vernier effect, the first two microrings had radius 11.272 um and the last two microrings had radius 14.090 um.
**The device was implemented using strip waveguides 450 nm wide on an SOI substrate with a 220 nm silicon layer and a 2 um SiO2 buffer.
**Heaters made of Ti/W with 3 um width and 200 nm thickness were fabricated 2.2 um above the Si waveguides.
**The method of gradient descent is a type of line-search optimization which chooses the step direction and step size based on the local gradient of the objective function.
**At each step, the local slope of the objective function with respect to each heating current was calculated using the five-point discrete derivative formula, T’(I).
**Using the local derivatives of the objective function with respect to the four heater currents, we computed the normalized gradient vector, which was then used to determine the next current settings.
**The three objective functions are defined as follows:
**For each objective function, we computed the error function at each iteration defined as:
**This error value becomes less and less as the relative improvement between successive iterations diminishes.
**One of the benefits of the TTP method is that a broadband source can be used for the optimization instead of using a tunable laser, thus greatly speeding up measurement in each iteration.
**Furthermore, objective functions like the dual-wavelength and TTP can take into account in-band ripples in the filter response.
**In contrast, the single-wavelength is more suitable for optimizing filter responses with no dips (ripples) in the passband.
**The spectral responses of the microring filter was measured using a tunable laser and a polarization controller
**The optical signal was then butt-coupled into and out of the silicon chip facet using lensed fibres and measured using a photodetector with a wavelength sampling step of 10 pm.
**Each tuning circuit consisted of a computer-connected microcontroller and a digial-to-analog converter which output a voltage to the resistive elements, with a resistance of around 200 $\Omega$.
**The maximum applied voltage was 3 V, which resulted in a maximum measured current of $\backsim$15 mA and a maximum applied power of 45 mW.
**The microrings had a tuning efficiency of about 80 pm/mW, the heater setup had an uncertainty of about \textpm 0.1 mW, giving our thermal tuning an uncertainty of approximately \textpm 8 pm.
**Our optimization started with a baseline scan of the filter spectrum with all currents set to half the operational range to allow for tuning in both directions, followed by coarse alignment through coordinate descent.
**Gradient descent optimization was then applied to the approximately aligned filter spectrum.
**We set the target wavelength to 1547.05 nm and performed the gradient descent optimization.
**The single black line indicates the target wavelength that contributed to the objective function.
**The initial spectrum after coarse alignment is shown by the dashed cyan spectrum in each plot.
**The single-wavelength method converged in approximately five iterations, with small heating adjustments made in the subsequent iterations as the tuning current step size decreased.
**The optimized filter response had a bandwidth of 25.1 GHz, a center wavelength detuning of 30 pm, and an in-band ripple of 0.53 dB.
**Due to variation in fabrication, the coupling-coefficients $\kappa_i$ have resulted in a spectrum with ripple.
**While resonant frequency tuning can improve filter response, in this case the spectrum we optimize has inherent ripple. TALK ABOUT DEVICE RIPPLE
**We set the target wavelengths 1546.95 and 1547.15 nm, indicating a 0.2 nm passband (or 25.05 GHz) and performed the gradient descent optimization.
**The two black lines indicate the two wavelength measurements, whose sum contributed to the objective function.
**The dual-wavelength method obtained an optimized bandwidth of 25.1 GHz, a center wavelength detuning of 10 pm, and an in-band ripple of 2.36 dB.
**The dual-wavelength optimization method took about nine iterations to converge. The method yielded much better center-wavelength accuracy than the single-wavelength method.
**We set the target wavelength range 1546.95 to 1547.15 nm and performed the gradient descent optimization.
**The rectangle defined by the black lines indicates the range of wavelengths whose sum contributed to the objective function.
**The TTP method obtained an optimized bandwidth of 20.0 GHz, a center wavelength detuning of 50 pm, and an in-band ripple of 2.56 dB.
**The number of samples, N, was 12.
**The TTP method objective function converged in about five iterations.
**Subsequent iterations alternated between a single-peaked result and a flatter, dual-peaked result.
**The first figure depicts the objective functions Phi_single, Phi_dual, and Phi_TTP for the filter as functions of the optimization iterations.
**Starting from the coordinate descent optimization, the single-wavelength and TTP cases rapidly converge under the gradient descent optimization, while the dual wavelength fluctuates around the optimized result.
**The TTP method achieved a stable optimization state the fastest, with each further iteration only slightly increasing the objective function value.
**In the second figure, we plot the 3dB bandwidths of the optimized spectra of the filter for each optimization method.
**The plot demonstrates that the TTP method yielded a 3dB bandwidth that was markedly different (20\% decrease) from the value obtained by both the single-wavelength and dual-wavelength methods, which achieved identical 3dB bandwidths of 25.1 GHz.
**The third figure plots the center wavelength detuning of each optimized response versus the optimization methods.
**The dual-wavelength optimization method clearly outperforms in terms of center wavelength accuracy with a detune of only 10 pm.
**The final figure shows the in-band ripple of the optimized spectra for each optimization method.
**The center wavelength accuracy of the dual-wavelength method comes with the trade-off of increased in-band ripple.
**The increased in-band ripple is to be expected as the dual-wavelength method artificially generates in-band ripple as we try to create two peaks at the edges of the passband.
**This figure plots the initial, ideal, and optimized spectra for all three optimization methods. The initial spectra is provided in cyan, the ideal is dashed black, single wavelength is blue, dual is red, and TTP is green.
**For this device, the single-wavelength method yielded the flattest passband with the smallest in-band ripple, while the dual-wavelength method provided the best center wavelength tuning accuracy.
**For our 4th-order Vernier microring filter with slight inherent ripples in the passband, we found the single-wavelength method provided the worst center wavelength accuracy but achieved the smallest in-band ripple.
**The dual-wavelength out-performed in center-wavelength accuracy but produced a significant in-band ripple.
**The TTP method was the most stable optimization process, but resulted in poor filter shape and in-band ripple.
**However, as the optimization results depend on the initial filter spectrum and the degree of ripples in the filter passband, the TTP method may provide better performance for other filter designs.
**We also note that none of the objective functions investigated directly sought to minimize in-band ripple.
**Such an objective function may be better suited for optimizing Chebyshev filters with inherent in-band ripples.
**In future experiments, we would also like to explore the dependence of the order of optimization for high-order devices and the effectiveness of weighted objective functions.
**Further research could develop an algorithm that learns what parameters to optimize for a given measured spectrum.
**Initially, heating changes would be made to increase the peak transmission.
**After some threshold value is approached, in-band ripple and center wavelength offset could become more important, and so the objective function would be more strongly influenced by these characteristics.