Fast photonic information processing using ... - OSA Publishing

Fast photonic information processing using semiconductor lasers with delayed optical feedback: Role of phase dynamics Romain Modeste Nguimdo,∗ Guy Verschaffelt, Jan Danckaert, and Guy Van der Sande Applied Physics Research Group, Vrije Universiteit Brussel, 1050 Brussels Belgium ∗ [email protected]

Abstract: Semiconductor lasers subject to delayed optical feedback have recently shown great potential in solving computationally hard tasks. By optically implementing a neuro-inspired computational scheme, called reservoir computing, based on the transient response to optical data injection, high processing speeds have been demonstrated. While previous efforts have focused on signal bandwidths limited by the semiconductor laser’s relaxation oscillation frequency, we demonstrate numerically that the much faster phase response makes significantly higher processing speeds attainable. Moreover, this also leads to shorter external cavity lengths facilitating future on-chip implementations. We numerically benchmark our system on a chaotic time-series prediction task considering two different feedback configurations. The results show that a prediction error below 4% can be obtained when the data is processed at 0.25 GSamples/s. In addition, our insight into the phase dynamics of optical injection in a semiconductor laser also provides a clear understanding of the system performance at different pump current levels, even below solitary laser threshold. Considering spontaneous emission noise and noise in the readout layer, we obtain good prediction performance at fast processing speeds for realistic values of the noise strength. © 2014 Optical Society of America OCIS codes: (140.5960) Semiconductor lasers ; (190.3100) Instabilities and chaos; (200.3050) Information processing; (250.4745) Optical processing devices.

References and links 1. J. P. Crutchfield, L. D. William, and S. Sudeshna, “Introduction to focus issue:intrinsic and designed computation: information processing in dynamical systems-beyond the digital hegemony,” Chaos 20, 037101–037107 (2010). 2. D. Woods and T. J. Naughton, “Optical computing: photonic neural networks,” Nat. Phys. 8 257–259 (2012). 3. W. Maass, T. Natschl¨ager, H. Markram, “Real-time computing without stable states: a new framework for neural computation based on perturbations,” Neural Comput. 14, 2531–2560 (2002). 4. H. Jaeger and H. Haas, “Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication,” Science 304, 78–80 (2004). 5. D. Verstraeten, B. Schrauwen, M. D’Haene, and D. Stroobandt, “An experimental unification of reservoir computing methods,” Neural Networks 20, 391–403 (2007). 6. J. J. Steil, “Backpropagation-decorrelation: Online recurrent learning with O(N) complexity,” In Proceedings of IJCNN ’04’ 1, 843–848 (2004). 7. H. J. Caulfield and S. Dolev, “Why future supercomputing requires optics,” Nat. Photon. 4, 261–263 (2010).

#206149 - $15.00 USD (C) 2014 OSA

Received 7 Feb 2014; revised 24 Mar 2014; accepted 24 Mar 2014; published 3 Apr 2014 7 April 2014 | Vol. 22, No. 7 | DOI:10.1364/OE.22.008672 | OPTICS EXPRESS 8672

8. K. Vandoorne, W. Dierckx, B. Schrauwen, D. Verstraeten, R. Baets, P. Bienstman, and J. Campenhout, “Towards optical signal processing using photonic reservoir computing,” Opt. Express 16, 11182–11192 (2008). 9. L. Appeltant, M. C. Soriano, G. Van der Sande, J. Danckaert, S. Massar, J. Dambre, B. Schrauwen, C. R. Mirasso, and I. Fischer, “Information processing using a single dynamical node as complex system,” Nat. Commun. 2, 468–472 (2011). 10. L. Larger, M. C. Soriano, D. Brunner, L. Appeltant, J. M. Gutierrez, L. Pesquera, C. R. Mirasso, and I. Fischer, “Photonic information processing beyond Turing: an optoelectronic implementation of reservoir computing,” Opt. Express 20, 3241–3249 (2012). 11. Y. Paquot, F. Duport, A. Smerieri, J. Dambre, B. Schrauwen, M. Haelterman, and S. Massar, “Optoelectronic reservoir computing,” Sci. Rep. 2, 287 (2012). 12. R. Martinenghi, S. Rybalko, M. Jacquot, Y. K. Chembo, and L. Larger, “Photonic nonlinear transient computing with multiple-delay wavelength dynamics,” Phys. Rev Lett. 108, 244101 (2012). 13. F. Duport, B. Schneider, A. Smerieri, M. Haelterman, and Serge Massar, “All Optical Reservoir Computing,” Optics Express 20, 22783–22795 (2012). 14. A. Smerieri, F. Duport, M. Haelterman, and S. Massar, “Analog readout for optical reservoir computers,” in Advances in Neural Information Processing Systems, Vol. 25, P. Bartlett, F. C. N. Pereira, C. J. C. Burges, L. Bottou and K.Q. Weinberger, eds. (MIT Press, 2012), pp. 953–961. 15. D. Brunner, M. C. Soriano, C. R. Mirasso, and I. Fischer, “Parallel photonic information processing at gigabyte per second data rates using transient states,” Nature Commun. 4, 1364 (2013). 16. K. Hicke, M. A. Escalona-Moran, D. Brunner, M. C. Soriano, I. Fischer, and C. R. Mirasso, “Information processing using transient dynamics of semiconductor lasers subject to delayed feedback,” IEEE J. Sel. Top. Quantum Electron. 19, 1501610 (2013). 17. R. Lang and K. Kobayashi, “External Optical Feedback Effects on Semiconductor Injection Laser Properties,” IEEE J. Quantum Electron. 16, 347–355 (1980) 18. T. Heil, A. Uchida, P. Davis, and T. Aida, “TE-TM dynamics in a semiconductor laser subject to polarizationrotated feedback,” Phys. Rev. A 68, 033811 (2003). 19. M. C. Soriano, J. Garc´ıa-Ojalvo, C. R. Mirasso, and I. Fischer, “Complex photonics: dynamics and applications of delay-coupled semiconductors lasers,” Rev. Mod. Phys. 85, 421–470 (2013). 20. R. M. Nguimdo, G. Verschaffelt, J. Danckaert, X. Leijtens, J. Bolk, and G. Van der Sande, “Fast random bit generation based on a single chaotic semiconductor ring laser,” Opt. Express 20, 28603–28613 (2012) 21. S. Xiang, W. Pan, B. Luo, L. S. Yan, X. H. Zou, N. Jiang, L. Yang, and H. N. Zhu, “Unpredictability-enhanced chaotic vertical-cavity surface-emitting lasers with variable-polarization optical feedback,” J. Lightw. Technol.29 2173–2179 (2011). 22. R. M. Nguimdo, M. C. Soriano, and P. Colet, “Role of the phase in the identification of delay time in semiconductor lasers with optical feedback,” Opt. Lett. 36, 4332–4334 (2011). 23. D. Rontani, A. Locquet, M. Sciamanna, D. S. Citrin, and S. Ortin, “Time-Delay Identification in a Chaotic Semiconductor Laser With Optical Feedback: A Dynamical Point of View,” IEEE J. Quantum Electron. 45, 879–891 (2009). 24. R. M. Nguimdo, G. Verschaffelt, J. Danckaert, and G. Van der Sande, “Loss of time-delay signature in chaotic semiconductor ring lasers,” Opt. Lett. 37, 2541–2544 (2012). 25. S. Wieczorek, B. Krauskopf, T. B. Simpson, and D. Lenstra, “The dynamical complexity of optically injected semiconductor lasers,” Phys. Rep. 416 1 (2005). 26. A. S. Weigend and N. A. Gershenfeld, “Time series prediction: Forecasting the future and understanding the past,” ftp://ftp.santafe.edu/pub/Time-Series/Competition (1993). 27. M. C. Soriano, S. Ort´ın, D. Brunner, L. Larger, C. R. Mirasso, I. Fischer, and L. Pesquera, “Optoelectronic reservoir computing: Tackling noise-induced performance degradation,” Opt. Express 21, 12–20 (2013). 28. L. Appeltant, G. Van der Sande, J. Danckaert, and I. Fischer, ”Constructing optimized binary masks for reservoir computing with delay systems,” Sci. Rep. 4, 3629 (2014). 29. M. C. Soriano, S. Ort´ın, L. Keuninckx, L. Appeltant, J. Danckaert, L. Pesquera, and G. Van der Sande, ”Delaybased Reservoir Computing: noise effects in a combined analog and digital implementation,” accepted to IEEE Trans. Neural Netw. Learn. Syst. (2014).

1.

Introduction

Traditional computers can be inefficient when trying to solve highly complex computational tasks such as speech recognition or facial recognition. Novel computational techniques are therefore highly desired [1, 2]. New brain-inspired information processing methods based on artificial neural networks have shown great potential in solving computationally hard tasks such as pattern recognition, time series prediction and classification at which the brain typically excels [3–6]. In such neural networks, a given task can be performed by first appropriately ad#206149 - $15.00 USD (C) 2014 OSA


Input(t)

Θ

M (t)

(a)

S(t) = M (t) ∗Input(t)

(b)

(c)

node separation

t

t 0

TD

2TD

3TD

0

TD

t 0

TD

2TD

3TD

Fig. 1. Illustration of the masking procedure. (a) The discrete input data. (b) The temporal mask constructed for N = 3 and with a node separation of θ . (c) The full preprocessed signal to be injected into the reservoir computer.

justing the strengths of the network connections, which is done through learning by example or in a training procedure. Training such a recurrent network is a highly nonlinear problem and requires a large amount of computational power. This problem is avoided in reservoir computing (RC) in which an artificial neural network is split into three separate layers: the input layer, the reservoir layer and the output layer, with the reservoir layer typically being a large recurrent network. The output layer is explicitly separated from the rest of the network and only the connections from the reservoir to the output layer are trained. As a result, a linear training algorithm can suffice. The computational power of the RC concept lies in the complex nonlinear transient response to an input signal of the very-high dimensional nonlinear system, that is the reservoir. Photonics, besides its application for super-computation [7], has been identified as a highly suitable technology for enabling the implementations of networks suited for RC [8]. The training of the system is performed in the output layer alone. Therefore this training does not alter the dynamical behavior of the reservoir itself. This means that the exact implementation of the reservoir is not constrained to a network of a large number (102 − 103 ) of nodes and any high-dimensional nonlinear system could be a suitable candidate for RC. Recently, it has been demonstrated that the RC architecture can be drastically simplified by relying on a single dynamical nonlinear node subject to delayed-feedback [9]. Delay systems are very attractive from an implementation point of view as only very few components are required to build them. This breakthrough has therefore paved the way for photonic implementations based on electrooptics [10–14]. Also, and this is the focus of this paper, an all-optical RC scheme based on a semiconductor laser (SL) with delayed optical feedback and using optical data injection has been shown to achieve state-of-the-art computational performances while operating at high bit rates [15, 16]. To ensure that the optical input signal can effectively access the full dimensionality of the delay-based reservoirs and that the system remains in a transient regime, a pre-processing procedure relying on the use of a temporal mask is required [9, 11]. This masking procedure is illustrated in Fig. 1. The input data is always discrete with a sampling time matching the delay time TD [see Fig. 1(a)]. A temporal mask M(t) is defined in Fig. 1(b), which is a piecewise constant function that is only non-zero over a temporal interval of the same length as the delay time TD (i.e. t ∈ [0, TD [). This interval is divided into N sub-intervals of length Θ, referred to as the node separation during which the mask is kept constant. The constant value of the temporal mask within one node separation is randomly drawn from a pre-defined set of the suitable values. The full input signal is then constructed by convoluting the discrete time series of the input data with this newly defined temporal mask [see Fig. 1(c)]. As a consequence, the node separation Θ defines the positions of so-called virtual nodes along the delay line. When one input sample (of length TD ) has been completely injected into the system, the delay line is tapped at the virtual nodes and the local intensities recorded. A linear combination of these virtual nodes will constitute the output of the system. The goal is that this output matches the desired target response of the RC. This can be achieved by performing a training procedure on

#206149 - $15.00 USD (C) 2014 OSA


the linear weights. It is clear that N and Θ which define the delay length TD = NΘ, and the mask properties limit the processing speed. N = 50 − 400 is typically sufficient and defined by the task at hand. The node separation Θ should be chosen not too large ensuring that the system is permanently maintained in a transient regime and not too small as the system would filter out the input data. The importance of the ratio between Θ and the characteristic timescale of the nonlinear node was pointed out before [9]. Most often, the optimized value of Θ is somewhat smaller than the intrinsic time scale of the nonlinear node. Given the relatively slow time scales of optoelectronic RC systems discussed in the literature, delay times of about 20μ s (i.e Θ ∼ 10 ns) have been used [10–14]. Shorter delay times of ≈ 80 ns have been also used in alloptical RC systems based on SLs [15, 16]. In all these schemes, the delay time is implemented experimentally using an optical fiber [10, 11] or with electronic delay lines based on a first-in first-out memory [9, 12]. In photonics, long delay lengths will limit the processing speed. In addition, they are not suitable for potential on-chip implementations of RC schemes as on-chip waveguide lengths are limited by absorption and chip real estate. Concerning semiconductor lasers with delayed optical feedback, previous works [15, 16] have targeted a node separation Θ ≈ 200 ps. This separation was determined from the relaxation oscillation (RO) period. In this contribution, we numerically demonstrate that in exactly the same system it is possible to achieve 10 times faster processing speeds with a 10 times shorter overall delay length as compared to [15, 16]. In particular we show that, thanks to the combined effect of optical feedback and injection, the phase dynamics which is much faster than the RO dynamics is also suitable for processing. We also demonstrate that one is free to define the node separation Θ in a broad band ranging from the fastest time scale of the system (i.e photon lifetime) to the intensity relaxation time without degrading computing power. Furthermore, we show that semiconductor lasers are suited for RC in a wide range of operation points. Considering spontaneous emission noise and noise in the readout layer, we obtain good prediction performance at fast processing speeds for realistic values of the noise strength. 2.

Model

We consider a quantum well SL operating in a single-longitudinal mode with delayed optical feedback. We extend this setup to include a Mach-Zehnder modulator (MZM) seeded by a contineous-wave laser. The MZM will be used to inject the data optically. This ensemble constitutes the reservoir for our RC scheme. It is modeled by the so-called Lang-Kobayashi equations [17,18,27] extended to include optical injection. We describe the reservoir’s dynamical behavior in terms of the mean-field slowly varying complex electric field amplitudes of both the parallel (E1 = |E1 |eiϕ1 ) and the perpendicular (E2 = |E2 |eiϕ2 ) polarization direction, and the carrier number N: 1 E˙1 = (1 + iα ) [G1 − γ1 ] E1 + η1 E1 (t − TD )e−iΩ0 TD + ξ1 (t) + kin j Ein j (t), 2 1 E˙2 = −iΔΩE2 + (1 + iα ) [G2 − γ2 ] E2 + η2 E1 (t − TD )e−iΩ0 TD + ξ2 (t), 2 I 0 N˙ = − γe N − G1 |E1 |2 − G2 |E2 |2 , e

(1) (2) (3)

where G1,2 = gm1,2 (N − N0 )/(1 + ε |E1,2 |2 ) stands for the optical gain, ε being the saturation factor. The parameters are the linewidth enhancement factor α , the pump current I0 , photon decay rates γ1,2 , electron decay rate γe , detuning ΔΩ between E1 and E2 , carrier number at transparency N0 , differential gains gm1,2 , loop delay time TD = NΘ, feedback strengths η1,2 and injection strength kin j . Ω0 is the solitary laser angular frequency. ξ1,2 are complex Gaussian #206149 - $15.00 USD (C) 2014 OSA


Table 1. Values used for numerical simulations

parameters α -factor Photon decay rates Electron decay rate Detuning between E1 and E2 Detuning E and Ein j Threshold of pump current Carrier number at ransparancy Differential gains Gain saturation coefficient Feedback rates Spontaneous emission factors Constant feedback phase injection rate Pump current Number of nodes Node separation Loop delay time Amplitude of the injected field Dimensionless bias voltage of the MZM

Designation α γ1,2 γe ΔΩ Δω Ith N0 gm1 gm2 ε η1 or η2 β1,2 Ω0 TD kin j I0 N Θ TD |E0 | Φ0

value 3.0 200 ns−1 1.5 ns−1 0.0 0.0 9 mA 1.8 × 107 10−5 ns−1 8.4 × 10−6 ns−1 10−7 see figure captions 10−6 0.0 50 ns−1 see figure captions 200 see figure captions TD = NΘ 200 π /4

white noise terms with zero mean and ξi (t)ξi∗ (t ) = 2βi γe N δ (t − t ) where i = 1, 2. The last term kin j Ein j (t) in Eq. (1) represents the optical injection from the MZM. The optical gains of the two modes are usually not equal. We assume that g1 > g2 such that the signal in E1 is dominant in the solitary laser. Note that each polarization mode can be subjected to a feedback either from the node itself [polarization maintained optical feedback (PMOF)] or from the other mode [polarization rotated optical feedback (PROF)]. In this contribution, we neglect the feedback from E2 as in ref. [16]. The feedback in only one mode can be experimentally implemented using linear polarizers and Faraday rotators for PMOF and PROF configurations, respectively. These two configurations are implemented in the model through the parameters η1 and η2 : η2 = 0 for PMOF configuration and η1 = 0 for PROF configuration. In practice, the data can be added optically to the nonlinear node via a MZM [13, 15, 16]. In this case, the input data convoluted with the mask is used to modulate the optical signal through the rf electrode of the MZM. The output of the MZM, i.e Ein j (t) is subsequently fed into the dominant polarization direction E1 . Then Ein j (t) can be written as |E0 | 1 + ei[S(t)+Φ0 ] eiΔω t , Ein j (t) = (4) 2 where Δω is the detuning between E1 and Ein j , |E0 | is the field amplitude of the injection. S(t) represent the normalized input data fed in the rf electrode, while Φ0 is the bias voltage of the MZM. Typically, S(t) results from the input signals after the pre-processing procedures by convoluting the input data with the mask M(t) as mentioned in the introduction. Thus the temporal structure of S(t) depends on Θ (see Fig. 1). In order to identify suitably Θ, it is necessary to first identify the time scales which influence the transient dynamics of the system. #206149 - $15.00 USD (C) 2014 OSA


3.

Characterization of the dynamical behavior of SL with delayed feedback

This section aims at providing some features regarding the dynamics of the SL with delayed feedback in the absence of any injection, i.e kin j = 0 ns−1 . Other parameters considered are as stated in Table 1 and in the figure captions [16]. With these parameters, the threshold of the pump current is Ith = 9 mA. The numerical results are obtained by integrating the rate equations using the 2nd -order Runge-Kutta method for stochastic equations with an integration step of 2 ps. We perform a pre-integration over a period of 0.5μ s (which is much longer than the longest time scale of the model) to account for transients. For kin j = 0 ns−1 and η1 = 20 ns−1 , the PMOF is chaotic in the whole range of pump currents we explored (i.e I0 ≤ 1.5Ith ) while the PROF configuration remains stable in the same range of the pump current. Several time scales such as the relaxation oscillation period τRO and the delay time TD can play a role in the dynamical behavior of a laser with feedback. If a time scale influences the intrinsic system dynamics, its signature can manifest itself in the intensity or in the phase dynamics. The relevant time scales can be typically revealed through the computation of statistical quantifiers for time scale identification such as the delayed mutual information, autocorrelation function and spectrum [20–24]. In order to identify the time scales present in the system, we consider a chaotic regime because in this regime, different time scales will be present in a single time trace.

-20

Intensity Relaxation (a) frequency Phase Relaxation frequency

-30 -40 0

from phase

from power

5

10 15 20 25 Frequency [GHz]

30

Autocorrelation

Spectra [dB]

-10

0.8

n

io

t xa

(b)

a

0.4

l Re e s a d Ph rio pe

Intensity Relaxation period

0

-0.4 0

0.1 0.2 0.3 0.4 0.5 0.6 Lag time [ns]

Fig. 2. (a) Spectra computed from intensity |E1 (t)|2 (black) and from phase ϕ1 (t) (gray, blue); (b) autocorrelation. The parameters are I0 = 1.3Ith , TD = 4 ns, β1,2 = 10−6 , η1 = 20 ns−1 , kin j = 0 ns−1 and η2 = 0 ns−1 (i.e PMOF configuration)

Figure 2(a) displays the results for the spectra as computed from intensity time series |E1 (t)|2 (black) and from the phase ϕ1 (t) recovered within the interval [−π , π ] (gray, blue) for I0 = 1.3Ith and TD = 4 ns considering the PMOF configuration. The power spectrum (black) reveals the relaxation period τR0 of the free-running SL through a clear peak located at the inverse of τR0 ≈ 0.48 ns as expected (see refs. [22, 24]). However, the phase spectrum (gray, blue) shows that the phase relaxes at a different time scale which is much faster than τR0 attached to the intensity. The phase spectrum is considerably broader with a peak at ≈ 18 GHz. The intensity relaxation period and the phase response time, i.e, τR0 and Tphase , respectively are further quantified in Fig. 2(b) through the computation of the autocorrelation from the intensity and the phase dynamics, respectively. Damped oscillations are seen in both cases, however, with a much shorter period for the phase. A damped oscillation period Tphase approx0.055 ns corresponding to the phase response frequency of 18 GHz can be identified in Fig. 2(b). In Fig. 3, we decrease I0 to just under the threshold and compute again the intensity and the

#206149 - $15.00 USD (C) 2014 OSA


(a)

from phase

-40 -60 -80 0

Phase Relaxation frequency

from

pow

er

5

0.2 Autocorrelation

Spectra [dB]

-20

Phase Relaxation period

0.1

(b)

0

-0.1

10 15 20 25 Frequency [GHz]

0

30

0.05 0.1 0.15 Lag time [ns]

0.2

Fig. 3. Same as in Fig. 2 for I0 = 0.9Ith .

(a)

~20 GHZ ~30 GHZ

Autocorrelation

Phase spectra [dB]

-20 -30 -40 -50 0

η1=30 ns-1 -1 η1=20 ns 10 20 30 40 Frequency [GHz]

50

1 0.8 (b) 0.4

-1

η1=30 ns-1 η1=20 ns

s 3n ns 05 ~0.

3 0.0

~

0

-0.4 0

0.04 0.08 Lag time [ns]

0.12

Fig. 4. (a) Phase spectra computed from phase ϕ1 (t) for η1 = 20 ns−1 (gray, blue in color) and η1 = 30 ns−1 (black); (b)corresponding autocorrelation function computed from the same ϕ1 (t). Other parameters are in Fig. 2.

phase spectra [Fig. 3(a)], as well as the autocorrelation from the phase dynamics [Fig. 3(b)]. τRO vanishes as expected while the amplitude of the intensity signal becomes very small. This small amplitude existing below, but near the threshold current is caused by the feedback. It is therefore expected to increase when increasing the feedback. While τRO ceases to exist below the threshold, the phase still relaxes almost at the same frequency as that observed for I0 > Ith , i.e ≈ 18 GHz [Fig 3(a)]. This relaxation is also confirmed from the autocorrelation which shows damped oscillations with a period ≈ 0.055 ns [Fig. 3(b)]. These results show that the phase response seems to be independent of the pump current and thus of τRO . However, it is related to the frequency of the laser’s external cavity modes which strongly depend on the feedback strength η1 . The nonlinear feedback terms sin [ϕ1 (t − TD ) − ϕ1 (t)] in the phase equation can indeed induce a change in frequency and create oscillations related to the external cavity modes, which are feedback strength (but not pump current) dependent. For strong feedback, the phase −1 ≈ η . By way relaxes at the frequency which is the closest to the feedback strength, i.e Tphase of illustration, Fig. 4 shows the phase response dependence on the feedback strength for η1 = 20 ns−1 (gray, blue) and η1 = 30 ns−1 (black). As can be seen, the peak in the phase spectrum is situated at ≈ 20 Ghz for η1 = 20 ns−1 and at ≈ 30 Ghz for η1 = 30 ns−1 . These results are further confirmed in the autocorrelation shown in Fig. 4(b) which, in each case, shows damped oscillations at a period corresponding to the inverse of the feedback strength. In the next section, we investigate how optical injection influences the dynamics of the reservoir. #206149 - $15.00 USD (C) 2014 OSA


4.

Characterization of the dynamical behavior of the reservoir without the input data

As pointed out in ref. [25], the external injection into a SL with delayed optical feedback can influence its dynamical regime. In the RC scheme presented here, a bias injection |E0 | 1 + eiΦ0 eiΔω t /2 remains even in the absence of the input data [S(t) = 0] when kin j = 0 and therefore this bias injection needs to be considered as a part of the reservoir. For kin j = 50 ns−1 (as will be considered later), the PROF configuration which was stable without the injection becomes unstable above I0 ≈ 1.4Ith for S(t) = 0. On the other hand, the PMOF which was unstable becomes stable up to I0 ≈ 1.15Ith due to the injection for S(t) = 0. This means that, in some instances, the bias injection stabilizes the SL’s output [25] while in other instances, it destabilizes the output. We focus here only on the parameter region for which the output is stable because it will be the most suitable regime for reservoir computing. To investigate the time scales in this regime, we compute again the autocorrelation function from the intensity time series considering kin j = 50 ns−1 . Note that the intrinsic noise, present in Eqs. (1)-(3) induces small excursions around the stable output in the time trace.

I0=1.1Ith I0=0.9Ith

Autocorrelation

0.8 0.4 0

-0.4 -0.8 0

0.1 0.2 0.3 0.4 0.5 0.6 Lag time [ns]

Fig. 5. Autocorrelation function showing the (a) intensity relaxation and (b) the carrier relaxation considering η1 = 10 ns−1 for the PMOF configuration.

In Fig. 5, we show for the PMOF configuration the results of the autocorrelation computed from the intensity time series for a value just under threshold (dashed) and for a value just above threshold (solid line). Below the threshold, the intensity relaxation consists of oscillations with period ≈ 0.075 ns and which are completely damped after a lag time < 0.2 ns [Fig. 5(a), dashed line]. Above the threshold, the oscillation period in the autocorrelation is ≈ 0.15 ns. It is clear that this value is still far away τRO ≈ 0.83 ns. In addition, the damped oscillations span over a time > 0.6 ns [Fig. 5(a), solid line]. In all cases studied, we found that the intensity and the phase (not shown here) relax at the same frequency in the presence of the bias injection. The stable output regime corresponds to a locking of the SL with delayed optical feedback to the injected signal. The locking relaxation frequency, which will define the damped oscillations observed in Fig. 5(a), is related to the detuning between the frequency of the injected signal and the frequency of the system without injection. Without injection (see Section 3) the laser will try to operate in a frequency range defined by the external cavity modes. Therefore, the locking oscillation frequency and consequently the oscillatory response of intensity and phase, can be fast and will be defined by the phase response time observed in Section 3. To gain further insight, we show in Fig. 5(b) the autocorrelation computed from the carrier number considering I0 = 0.9Ith (dashed line) and I0 = 1.1Ith (solid line). It turns out in both cases that the correlation decays to zero after ≈ 2γe−1 for I0 = 0.9Ith and ≈ 2τRO for I0 = 1.1Ith . This proves that, above threshold, the normal relaxation oscillations of the solitary laser, have #206149 - $15.00 USD (C) 2014 OSA


been replaced by these locking oscillations, but that the damping of these locking oscillations is still driven by the slower carrier dynamics. In the next sections, we investigate the suitability of these time scales for RC. 5.

Performance of delay-based RC using SLs

The above results have illustrated two main features of the reservoir dynamics which may be useful for RC: the phase and the intensity relax at a fast time scale in the presence of the bias injection and the transient time spans over several oscillation periods when I0 > Ith . This suggests that the system can successfully perform RC tasks at different values of Θ, which we will analyze now. Typical benchmark tasks to test the RC performance are Signal Classification, Nonlinear Channel Equalization, Isolated Spoken Digit Recognition and Santa-Fe Time Series Prediction [8,10–12]. The latter is particularly challenging because it requires both nonlinearity and memory suggesting that feedback plays a role. Therefore, we use this task throughout the rest of the paper in order to quantify the RC performance. The Santa-Fe data used are intensity time series recorded from a real far-infrared laser operating in a chaotic state [26]. Our Santa-Fe data set contains 10000 points and we use the first 4000 points (first 3000 points for training and the next 1000 for testing). The target is to predict the next sample in the chaotic time trace before it has been injected into the reservoir computer (one-step ahead prediction). The performance on this task is typically evaluated based on the normalized mean square error (NMSE) defined as 2 y(n) − ytarget (n) , (5) NMSE(y, ytarget ) =

2 ytarget (n) − ytarget (n) where y is the predicted value while ytarget is the expected value, n is a discrete time index, . and . stand for the norm and the average, respectively. Typically, the system is considered to be performing well when the NMSE stays below 10%. We consider a reservoir with N = TD /Θ = 200 virtual nodes and a random mask with four discrete levels (0, 0.25, 0.75, 1) [27]. Procedures to construct masks for optimal performance are not considered here, because they are limited to two-valued masks [28]. The number of Intensity response

Input data

(a)

1

Phase response

(b)

5

10

Node Number

15

1

5

10

Node Number

15

Fig. 6. Temporal profiles of the data input at the rf electrode of the MZM (back) and the SL response in the intensity (red) and in the phase (blue) for (a) Θ = 200 ps and (b) Θ = 20 ps considering η1 = 10 ns−1 (PMOF configuration), β1,2 = 10−6 and I0 = 1.1Ith .

#206149 - $15.00 USD (C) 2014 OSA


nodes N is kept constant throughout the rest of this paper, meaning that any change in Θ will be accompanied by a change in the delay length TD and in the temporal structure of S(t). The mask is used to preprocess the Santa-Fe data and the resulting signal is rescaled so that 0 ≤ S(t) ≤ π /2. The data is injected in the rf input of the MZM after the reservoir has reached its steady state, i.e after the transient time.

NMSE

0.8 0.6

(a) PMOF

Θ=20 ps Θ=200 ps

(b) PROF

Θ=20 ps Θ=200 ps

0.4 0.2 0 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 I0/Ith I0/Ith

Fig. 7. (a) NMSE for PMOF considering η1 = 10 ns−1 , (b) PROF considering η2 = 10 ns−1 when the data are input and read out in the dominant mode which provides feedback to the other mode. The injection strength considered for both cases is kin j = 50 ns−1 .

Figure 6 displays the temporal profiles of the reservoir response both in the intensity (red) and the phase (blue), as well as the data input at the rf electrode of the MZM (black) for both Θ = 200 ps (a) and Θ = 20 ps (b). We can now relate the results from Fig. 5 to the transient response of the reservoir to the masked input. For longer node distances [as in Fig. 6(a)], each time the input signal (black) jumps to a new level, the laser responds initially fast both in intensity and phase. This oscillatory response is then slowly damped as predicted by Fig. 5. As explained in Ref. [9], to achieve a good computational performance, it is essential that the transient response to an input level is not completely damped before the system is subjected to the next new input level. As a result, Θ = 200 ps, can be considered as an adequate choice for the node distance. Again according to Ref. [9], the transient response also needs time to develop to measurable levels. In other words, the node distance should not be too small. As seen in Fig. 6(a), the node distance can still be reduced due to the very fast transient response. In Fig. 6(b), for Θ = 20 ps, the data is injected at a speed TD−1 = 0.25 GSamples/s. Despite the high speed at which the data is injected into the reservoir, it can be seen that both the intensity and the phase respond well to the external stimulus. For very small node distances Θ, an appreciable transient response can be observed and this is mainly due to the very fast (phase-driven) locking relaxation, while for longer node distances Θ the transient response is still measurable thanks to the much slower carrier relaxation. Therefore, we can expect good computational performance in a broad range of Θ spanning at least one order of magnitude. In order to identify the most suitable parameter regimes for which our system can successfully predict the chaotic input signal one-time step ahead in the future, we show in Fig. 7 the performance expressed by the NMSE as a function of the pump current I0 for two different values of Θ: one is close to 2Tphase /3 and another is close to 2τRO /3 (i.e Θ = 20 ps and Θ = 200 ps) both considering the PMOF configuration for η1 = 10 ns−1 [Fig. 7(a)] and PROF configuration for η2 = 10 ns−1 [Fig. 7(b)]. For Θ = 200 ps () which was experimentally used #206149 - $15.00 USD (C) 2014 OSA


in refs [15,16] for the same system as studied here, we find a good agreement with experimental results. In particular, as experimentally shown in Fig. 3(a) in ref. [15] and Fig. 11 in ref. [16], a good performance (characterized by a small NMSE) is obtained close to the threshold current for the PMOF configuration (). This performance rapidly degrades for pump currents well above the threshold. Remarkably, when decreasing Θ to a small value of Θ = 20 ps, similar results are obtained(•) although this value is far below the RO period and has therefore not been considered in previous studies on the topic. It can even be seen that these results are consistently better than or equal to those obtained for Θ = 200 ps. For the PROF configuration, it can be seen in Fig. 7(b) that the system has a good performance over a broader range of pump current values as compared to the PMOF configuration. The good performance in a large range of the pump current is because, in the absence of the input data, the PROF configuration is stable for these pump currents. It is worth noting that in the absence of input (S(t) = 0), the external bias injection, i.e |E0 | 1 + eiΦ0 eiΔω t /2 favors the emergence of steady state emission for parameter values for which acceptable NMSE values have been found for PMOF configuration. η=1 ns-1

1 NMSE

0.8

(a) PMOF

η=10 ns

-1

η=20 ns

-1

η=30 ns

-1

(b) PROF

0.6 0.4 0.2 0 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 I0/Ith I0/Ith

Fig. 8. NMSE for SL with (a) PMOF and (b) PROF considering Θ = 20 ps for different values of feedback strengths η = η1 for PMOF or η = η2 for PROF. The parameters are TD = NΘ, N=200 and kin j = 50 ns−1 .

Considering Θ = 20 ps we further explore, in both configurations, the optimized parameters by simulating the NMSE as a function of I0 for different values of the feedback strengths (Fig. 8). In particular for η1 = 1 ns−1 , the PMOF configuration becomes stable for a broad range of pump currents leading thus to acceptable NMSE in this range. However, the range of I0 for which small NMSE values are obtained below the threshold is reduced due to the fact that the amplitude completely vanishes for a noise-free system (or falls into the background noise when noise is taken into account). As the feedback strength is gradually increased, the region of small NMSE values is increasingly confined to small values of the pump current. For the PROF configuration, it is seen that the increase of the feedback strength tends to improve the system performance. The PROF configuration is, in fact, very stable so that it may be difficult for the system to discriminate between similar inputs belonging to different classes (separability property of RC). Such a stable state becomes easily perturbed as the feedback is gradually increased and therefore the system can discriminate between such inputs. As a consequence, the NMSE is improved. By way of illustration, the NMSE values for η2 = 30 ns−1 are slightly better than those obtained for η2 = 20 ns−1 . In particular, the NMSE is smaller than 0.03 for 1.2Ith ≤ I0 ≤ 1.4Ith , even with the realistic level of noise that is taken into account. We note that #206149 - $15.00 USD (C) 2014 OSA


we found a similar error considering a noise-free reservoir. For a comprehensive study of noise effects in delay-based reservoir computers, we refer the reader to section 6 and to Ref. [29]. Interestingly, NMSE≈ 0.03 obtained for I0 ≈ 1.35Ith and η1 = 1 ns−1 (i.e PMOF configuration) is of great practical importance for on-chip implementations: the above threshold pump current values lead to higher output powers which are easy to detect; PMOF does not need any polarization rotation; the low feedback levels can be obtained without a strong amplification even when there is a lot of absorption. We would also like to mention that similar results as those shown in Fig. 8 have been obtained for other values of Δω and ΔΩ while the other parameters are kept unchanged.

0.5

NMSE

0.4

PMOF (a)

I0=1.1Ith I0=0.9Ith

PROF (b) I0=1.4Ith I0=0.9Ith

0.3 0.2 0.1 0 0

50

100 150 200 250 0 Θ [ps]

50

100 150 200 250 Θ [ps]

Fig. 9. NMSE as a function of Θ for a SL with (a) PMOF considering η1 = 10 ns−1 and (b) PROF considering η2 = 30 ns−1 .

For further insight, we next consider the parameter sets for which the best performance is obtained in Fig. 8 for each configuration and we investigate the dependence of the reservoir performance on Θ. Figure 9 shows the NMSE values for η1 = 10 ns−1 and I0 = 1.1Ith for the PMOF configuration (a), and η2 = 30 ns−1 and I0 = 1.4Ith for the PROF configuration (b). In both configurations it can be seen that, similar as for the phase dynamics of Figs. 2 and 3, the NMSE does not significantly change with Θ when the system operates above the threshold current provided that the fixed point is stable for S(t) = 0 (no input data). This good performance can also be explained by the fact that the reservoir is in the transient state for all the values of Θ we explored as shown in Fig. 5(a). The node separation Θ can be therefore freely −1 ) and the RO period τRO chosen between the fastest intrinsic time scale of the model (i.e γ1,2 without significantly degrading the performance when the system operates above the threshold. However for I0 < Ith , the RC performance does depend on the value of Θ. In particular for the PMOF configuration, NMSE< 0.1 is obtained only for Θ near ≈ 2Tphase /3 and ≈ 2τRO /3. This further evidences that all the coexisting intrinsic time scales play a role to maintain the system in a transient state useful for RC. From this point of view, the coexisting of several intrinsic time scales allows for a broad range of Θ values to be suitable for RC. For the PROF configuration, good performance is restricted around Θ ≈ 2Tphase /3 when I0 < Ith . 6.

Effects of noise

Reservoir computers based on physical systems are typically subjected to noise. The main noise contributions in SL-based RC schemes are the intrinsic noise from the spontaneous emission and the noise in the readout layer of the reservoir generated by the photodetectors or/and analogto digital converters. This section further addresses the influence of these noise sources on the

#206149 - $15.00 USD (C) 2014 OSA


RC performance of the system under study. 6.1.

Intrinsic noise to the laser

The level of the intrinsic noise in the reservoir as modeled in Eqs. (1) and (2) through the spontaneous emission factors β1,2 can typically differ from one system to another. Figure 10 illustrates the change in the system performance for different values of the SL spontaneous emission noise from the very low levels to the relatively high levels. More precisely, we show the influence of the reservoir’s intrinsic noise when the system operates just above threshold [Fig. 10(a)] and just under threshold [Fig. 10(b)].

0.8

(a) I0=1.1Ith

NMSE

PMOF 0.6 PROF

(b) I0=0.9Ith PMOF PROF

0.4 0.2 0 -10 -9 -8 -7 -6 -5 -4 -3 -10 -9 -8 -7 -6 -5 -4 -3 Log10(β1,2 )

Log10(β1,2 )

Fig. 10. NMSE as a function of the intrinsic noise strength for (a) I0 = 1.1Ith and (b) for I0 = 0.9Ith . Other parameters are as in Fig. 7

For I0 = 1.1Ith , it can be seen that the NMSE gradually increases for both configurations as the noise strength increases. It is also clear that noise degrades the performance of the RC. But for realistic levels of the noise (i.e 10−7 ≤ β1,2 ≤ 10−5 ), the increase of NMSE is not significant for the PMOF configuration (NMSE 5%). For the PROF configuration, however, we notice a profound increase in the NMSE from 5% to 20% when the noise strength β1,2 is scanned from 10−7 to 10−5 . Note that the PROF configuration can be further improved by using other values for the pump current and the feedback strength. For I0 = 0.9Ith , the role of the intrinsic noise is very different [Fig. 10(b)]. When noise is too low (β1,2 10−7 ), the reservoir completely fails to predict the next sample in the chaotic time trace. When 10−7 β1,2 10−5 , the intrinsic noise is rather beneficial as acceptable NMSE values are obtained. The noise in this range can be therefore viewed as equivalent to ridge regularization often used in numerical simulations to increase the robustness of the reservoir [15, 16]. Above a certain noise strength, the effect of noise becomes detrimental as found for I0 > Ith . 6.2.

Noise in the readout layer

In the previous sections, the simulations were done neglecting the noise in the readout layer in order to focus purely on the effects of the major design parameters of the reservoir. At high speeds, however, noise in the readout layer could become important as high speed photodetectors and/or analog-to-digital converters are usually more noisy. We model this noise by adding an extra noise term to the reservoir output signal such that it is read as |E1 (t)|2 + Dout ξout (t) instead of |E1 (t)|2 , where Dout is the noise amplitude while ξout (t) is a Gaussian white noise

#206149 - $15.00 USD (C) 2014 OSA


∗ (t ) = δ (t − t ). Here D with zero mean and correlation ξout (t)ξout out = 0 refers to noiseless photodetectors and/or analog-to-digital converters.

(a) PMOF 0.8

(b) PROF

without readout noise with readout noise

NMSE

0.6 0.4 0.2 0 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 I0/Ith Fig. 11. NMSE as a function of I0 /Ith without (•) and with () noise in the readout layer for (a) PMOF configuration with η1 = 10 ns−1 and (b) PROF configuration with η2 = 30 ns−1 . The noise strength in the readout layer is chosen such that it yiels to the signal-to-noise ratio of ≈ 20 dB at I0 = 1.1Ith for PMOF configuration. β1,2 = 10−6

To illustrate the detrimental effects of the noise in the readout layer, we evaluate again the system performance under the conditions of Fig. 7 and considering the value for Dout such that the signal-to-noise ratio (SNR) after the detection yields to ≈ 20 dB for I0 = 1.1Ith in the PMOF configuration. Here we use the variance of the output signal and that of the noise to determine the SNR. This value is the smallest pump current at which we find very good performance (NMSE 4%). We keep the strength Dout unchanged as the noise in the readout layer does not depend on the configuration nor on the pump current. In Figure 11, we show the NMSE for different values of the pump current taking into account noise in the readout layer. We take η1 = 10 ns−1 for PMOF configuration and η2 = 30 ns−1 for PROF configuration which are the feedback strengths for which the smallest NMSE is obtained for the PMOF and the PROF configurations, respectively (see Fig. 8). For comparison, we repeat the simulations for these feedback strengths and for Dout = 0 and show the results in the same plot. In both configurations, it is obvious that noise in the readout layer is very detrimental when the reservoir operates below threshold. This is because the amplitude of the signal is small below threshold and therefore the SNR becomes too small (SNR 2 dB). The effect of such noise is mitigated above the threshold because the SNR gradually increases as the pump current increases. As a result, the NMSE is very similar to that obtained for Dout = 0. 7.

Conclusions

We have studied the properties of a delay-based RC system with SLs. For the first time, we have investigated the parameter region where the node separation is much shorter than the relaxation oscillation period. Such short node separations are particularly interesting as they allow to increase the data processing speed. The results indicate that, thanks to the phase response which relaxes faster than the relaxation oscillations, good RC performance can be obtained even when the system is evaluated considering read out signals in the intensity |E1 (t)|2 . We also found that due to optical injection, which is an integral part of the setup, the intensity reacts at speeds related to the phase dynamics. As phase dynamics exists even below the threshold, good performance can be obtained below the threshold current provided that Θ is suitably chosen. More #206149 - $15.00 USD (C) 2014 OSA


precisely, the system can still respond to an external stimulus below the threshold through the phase when the intensity does not completely vanish. Due to different time scales which coexist in SLs, we found that any value of the node separation between the fastest time scale of the model and the RO period can be used above the threshold. This leads to an overall delay length between 1 ns and 420 ns for N = 200 virtual nodes. It is worth noting that small Θ values and, thus small delay lengths, are useful for compact on-chip implementations. Furthermore, a short delay also allows to increase the processing speed as the data is fed at the delay period. In addition, considering spontaneous emission noise and noise in the readout layer (i.e noise in detectors and analog-to-digital converters), we have found that good RC performance can be obtained at fast processing speeds for realistic values of the noise strength. As a final remark, we also note that our RC scheme is robust to both intrinsic noise and noise in the readout layer. These results are promising for implementing compact on-chip delay-based RC schemes with fast processing speeds. Acknowledgments The authors thank Dr. M. C. Soriano, Profs. S. Massar and I. Fischer for helpful discussions. The authors acknowledge the Research Foundation Flanders (FWO) for project support, the Research Council of the VUB and the Interuniversity Attraction Poles program of the Belgian Science Policy Office, under grant IAP P7-35 photonics@be.

#206149 - $15.00 USD (C) 2014 OSA