<table>
<thead>
<tr>
<th>Title</th>
<th>A Four-Context Optically Differential Reconfigurable Gate</th>
</tr>
</thead>
<tbody>
<tr>
<td>Author(s)</td>
<td>Watanabe, Minoru; Nakajima, Mao</td>
</tr>
<tr>
<td>Citation</td>
<td>Journal of Lightwave Technology. 27(20), p. 4460-4470</td>
</tr>
<tr>
<td>Issue Date</td>
<td>2009-10-15</td>
</tr>
<tr>
<td>URL</td>
<td><a href="http://hdl.handle.net/10297/3927">http://hdl.handle.net/10297/3927</a></td>
</tr>
<tr>
<td>Rights</td>
<td>(c) 2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE</td>
</tr>
</tbody>
</table>
Abstract—Reconfiguration applications based on reconfigurable devices present new computational paradigms since, by increasing the reconfiguration frequency of reconfigurable devices, their activity and performance can be improved dramatically. Recently, Optically Reconfigurable Gate Arrays (ORGAs) with a holographic memory were developed. They realize rapid reconfigurations and numerous reconfiguration contexts. Furthermore, Optically Differential Reconfigurable Gate Arrays (ODRGAs) have been developed to accelerate optical reconfigurations of conventional ORGAs. However, fast configuration experiments under multiple contexts exploiting the ODRGA architecture have never been reported. Therefore, this paper presents a four-context ODRGA system and experimental results demonstrating its fast reconfiguration. The advantage of the ODRGA architecture is discussed based on those results.

Index Terms—Field programmable gate arrays, holographic memories, optical data processing, programmable logic devices, semiconductor laser arrays.

I. INTRODUCTION

FIELD Programmable Gate Arrays (FPGAs) have been used widely in recent years because of their flexible reconfiguration capabilities [1]–[3]. Moreover, demand for high-speed reconfigurable devices has been increasing. If circuit information can be programmed rapidly from a memory to a gate array, idle circuits can be removed; only necessary circuits need be programmed into the gate array at the time, thereby increasing the gate-array activity. Moreover, such devices offer the possibility of providing a virtual gate count that is much larger than those of currently available VLSIs.

For those reasons, reconfiguration applications based on such reconfigurable devices can introduce new computational paradigms.

Nevertheless, because FPGA reconfiguration requires more than several milliseconds and because its gate array can not be used during reconfiguration, FPGAs are unsuitable devices for dynamic reconfiguration.

On the other hand, high-speed reconfigurable devices have been developed, e.g., DAP/DNA chips, DRP chips, and multi-context FPGAs [4]–[9]. Such devices package reconfiguration memories and a microprocessor array or a gate array onto a chip. The internal reconfiguration memory stores reconfiguration contexts of 4–16 banks, which can be changed from one to another during a clock cycle. Thereby, an arithmetic logic unit or a gate array of such devices can be reconfigured on every clock cycle of a few nanoseconds. However, increasing the amount of internal reconfiguration memory while simultaneously maintaining the gate density is extremely difficult because the gate density and the amount of internal reconfiguration memory present a tradeoff relation.

As another rapid configuration approach, optically programmable gate arrays (OPGAs) with a holographic memory have been developed [10]–[12]. In the OPGAs, many reconfiguration contexts can be stored in a holographic memory. Those contexts are subsequently read out by a laser array in a very short time. Therefore, the architecture allows both realization of fast reconfigurations and numerous reconfiguration contexts. However, the OPGA has an issue in the OPGA-VLSI part. The OPGA-VLSI consists of an optical receiver array and a gate array with configuration SRAM. The construction of a gate array with a configuration SRAM is the same as that of FPGAs. Because the configuration data are transferred serially between the optical receiver array and the configuration SRAM of the gate array, the reconfiguration period is limited to 16–20μs. Moreover, the architecture prohibits the use of the gate array during reconfiguration. Consequently, this device is also unsuitable for dynamic reconfigurations.

To realize fast reconfiguration, an Optically Differential Reconfigurable Gate Array (ODRGA) has been developed [13], [14]. The ODRGA architecture facilitates both rapid configuration and the use of a gate array during reconfiguration. Moreover, the ODRGA architecture has a much higher reconfiguration frequency than those of either conventional ORGAs or OPGAs.

To date, optical configuration records have been reported as follows. A dynamic ORGA has achieved a 1.59 ms holographic reconfiguration [15]. An OPGA has demonstrated a 16 μs holographic configuration [10]–[12]. In addition, as a method not using holographic memory, a two-context configuration system has demonstrated 220 ns reconfiguration using image formulation [16]. Nevertheless, this latter method had an important shortcoming: its reconfiguration contexts were limited to two. On the other hand, the configuration contexts are not limited in the holographic configuration. Therefore, its multicontext implementation must be a holographic configuration.
This paper presents a four-context holographic configuration system and the experimental results of fast reconfigurations. The advantages of the ODRGA architecture are discussed based on those results.

II. ODRGA ARCHITECTURE

A. Holographic Configuration Property

Since an ORGA receives optical configuration contexts on photodiodes and because the photodiode response frequency is typically proportional to the light intensity, as shown in Fig. 1, high-intensity light reduces the period of optical reconfiguration. Therefore, the easiest way to reduce reconfiguration periods is to use high-powered lasers. However, the uses of such high-power lasers increase the unit’s power consumption and might require the use of a cooling system. Such a cooling system would greatly increase the package size; for that reason, high-power lasers should be used only as a last resort.

In an ORGA, a holographic memory generates optical configuration contexts. A holographic memory has a property by which the light intensity of each bit diffracted from a holographic memory is inversely proportional to the number of bright bits included in a configuration context. Summarizing those points very simply, if the number of bright bits in a certain configuration context can be decreased, the reconfiguration speed of the context can be accelerated with no increase of laser power. The differential reconfiguration architecture is an architecture that reduces the number of such bright bits included in a configuration context.

B. Differential Reconfiguration Architecture

A differential reconfiguration architecture of ODRGAs enables bit-by-bit reconfiguration based on the difference between a previous reconfiguration context and the subsequent reconfiguration context. That difference is programmed in a holographic memory as a reconfiguration context. A programmable gate array in an ODRGA-VLSI can store a context. A reconfiguration is executed using the previous context stored in ODRGA-VLSI and a different context in a holographic memory. The following equation describes operation of an optical differential reconfiguration using bit-length N generalized from the length of configuration bits on an ODRGA. The i-th configuration pattern is represented in the form of N-dimensional vectors as

\[ \alpha_i = (\alpha_1, \alpha_2, \ldots, \alpha_N) \]

where each element of the vectors takes a binary value \( \{0,1\} \). It is assumed ex-ante that the prior configuration pattern \( \alpha_i \) has

Fig. 1. Relationship between photodiode response frequency and its incoming light intensity.
already been stored on a VLSI chip and that a subsequent configuration pattern \( \alpha_{i+1} \) has already been determined. The \( i \)-th difference vector \( \gamma_i \) is defined in the following equation as

\[
\gamma_i = \alpha_{i+1} \oplus \alpha_i
\]  

(2)

where \( \oplus \) represents an exclusive-or operation between binary vectors; also, \( \gamma_i \) expresses the \( i \)-th reconfiguration pattern, which is written to an optical holographic memory. A differential reconfiguration procedure is performed using an exclusive-or operation between the difference vector \( \gamma_i \) and the prior configuration vector \( \alpha_i \) as follows:

\[
\alpha_{i+1} = \gamma_i \oplus \alpha_i
\]  

(3)

This fact indicates that the subsequent configuration vector \( \alpha_{i+1} \) is generated based on the previous configuration vector \( \alpha_i \), which is stored on the VLSI chip, and that an arbitrary location of the gate array on the VLSI chip can be reconfigured where elements of the difference vector \( \gamma_i \) equal 1, corresponding to the bright state of light. Therefore, that differential reconfiguration strategy enables bit-by-bit reconfiguration of an arbitrary area. Since dynamic reconfiguration is always executed partially and not entirely, the bit-by-bit reconfiguration capability can reduce the number of bright bits included in a reconfiguration context drastically, so that the reconfiguration frequency can be accelerated.

C. Inversion Technique

To realize a higher-speed reconfiguration in addition to the effect of the differential reconfiguration architecture of a bit-by-bit reconfiguration, an inversion technique is introduced. In this strategy, each configuration datum for logic blocks, switching matrices, and I/O blocks are divided into small segments. For small segments, the inversion technique is applied. The following discussion uses bit-length \( N_s \), which represents the number of bits included in a small segment. Similarly to the definition of the differential reconfiguration architecture, the \( i \)-th configuration vector \( \alpha_i \) and the \( i \)-th difference vector \( \gamma_i \) are indicated in the forms of \( N_s \)-dimensional vector as

\[
\alpha_i = (\alpha_1, \alpha_2, \ldots, \alpha_{N_s})
\]  

(4)

\[
\gamma_i = (\gamma_1, \gamma_2, \ldots, \gamma_{N_s})
\]  

(5)

where each element of the vectors takes a binary value \( \{0, 1\} \). In addition, the \( i \)-th difference vector \( \gamma_i \) is defined in the following equation as

\[
\gamma_i = \alpha_{i+1} \oplus \alpha_i \oplus C_i
\]

\[
I_{i+1} = C_i \oplus I_i
\]  

(6)

where \( I_i, I_{i+1} \), and \( C_i \) mean the current \( i \)-th inverse variable, the successive \( i+1 \)-th inverse variable, and the condition variable, which also take a binary value \( \{0, 1\} \), the role of which is to decide whether the exclusive OR operator takes \( \alpha_{i+1} \) and the current configuration vector \( \alpha_i \) or \( \alpha_{i+1} \) and its inversion vector \( \overline{\alpha}_i \). Condition variable \( C_i \) is given as shown below

\[
C_i = \begin{cases} 
1 : & \sum_{j=1}^{N_s} (\alpha_{i+1}(j) \oplus \alpha_i(j)) \geq \left[ \frac{N_s}{2} + 1 \right] \\
0 : & \text{otherwise.}
\end{cases}
\]  

(7)

In that equation, \( \lceil N_s/2 \rceil + 1 \) denotes \( N_s/2 + 1 \) is rounded down to the nearest whole number. A subsequent configuration vector \( \alpha_{i+1} \) in the same segment is generated by calculating

Fig. 2. Light intensity of each bit of optical configuration context diffracted from a holographic memory. Panels (a) and (d) show the light intensity of a configuration context including a single bright bit. Panels (b) and (e) show the light intensity of a configuration context including 12 bright bits. Panels (c) and (f) show the light intensity of a configuration context including 23 bright bits.
exclusive OR operations between the difference vector $\gamma_i$ and the current configuration vector $\alpha_i$, as follows:

$$\alpha_{i+1} = \gamma_i \oplus \alpha_i.$$  \hfill (8)

Therefore, the next configuration vector $\alpha_{i+1}$ is generated using the current configuration vector $\alpha_i$ when the next configuration vector $\alpha_{i+1}$ is more similar to the current configuration vector $\alpha_i$ than its inversion vector $\tilde{\alpha}_i$. In contrast, the vector $\alpha_{i+1}$ is generated using vector $\alpha_i$ when an adverse condition obtains. Here, the difference vector $\gamma_i$ and the condition variable $C_i$ are programmed for a holographic memory.

The reduction efficiency of the number of bright bits of the inversion configuration method is discussed. Here, it is assumed that configuration contexts are given continuously for an ORGA-VLSI and that they uniformly include all possible patterns. For example, regarding 4-bit configuration, all possible patterns means 16 patterns of “0000”, “0001”…, and “1111”.

Under such a condition, first, the reduction efficiency of the number of bright bits in a configuration context of conventional ORGAs is estimated. The average number of 1s corresponding to laser irradiation is calculated by counting bit 1 of all possible vectors and dividing that quantity by $N_s 2^N$ of the summation of bits of all possible vectors, as in the following equation:

$$K_{\text{ORGA}} = \frac{\sum_{r=1}^{N_s} r \cdot N_s C_r}{2^N N_s} = \frac{1}{2}.$$  \hfill (9)

where $N_s C_r$ is a combination. Similarly, the reduction of the number of bright bits included in a configuration context of the inversion configuration method can be estimated as presented in Table I. The average number of ‘1’ writings corresponding to laser irradiation is calculated by counting the ‘1’ bits of all possible vectors and dividing that number by $N_s 2^N$ of the summation of bits of all possible vectors, as in the following equation:

$$K_{\text{NEW}} = \frac{\sum_{r=1}^{[N_s/2]} r \cdot N_s C_r}{2^N N_s} + \frac{\sum_{r=[N_s/2+1]}^{N_s} (N_s - r + 1) \cdot N_s C_r}{2^N N_s}.$$  \hfill (10)

The first term in the right side of the upper (10) is identical to (9). In this case, an inversion bit is equal to 0. In addition, the second term in the right side of the upper (10) represents the case in which the inversion bit is equal to 1. Table I portrays a calculation example of the number of bright bits in a configuration context including four configuration bits. In this case, the average number of bright bits can be decreased from 2.0 of conventional ORGAs to 1.526. Using this inversion method, in the case of four bits, about 22% of bright bits are removable. Consequently, the reconfiguration frequency can be increased.
D. Differential Reconfiguration Circuit

Fig. 4 portrays a circuit diagram of a single-segment differential reconfiguration circuit. The configuration circuit consists of refresh transistors, photodiodes, toggle-flip-flops (T-FFs), and exclusive-OR gates (EXORs). The configuration procedure is begun by activating refresh transistors to charge the junction capacitances of photodiodes. After charging is completed, laser light is irradiated onto the photodiodes. The states of photodiodes that have received light become ‘0’ and the states of others become ‘1’. Then, the states are latched by toggle-flip-flops. The toggle-flip-flops connected to photodiodes that have received light flip and the other toggle-flip-flops retain their previous states. The execution means a differential reconfiguration strategy. In addition, a toggle flip-flop with role of the inverse variable is the left-most side of toggle-flip-flops. The output of the toggle flip-flop determines whether the outputs of the other toggle-flip-flops are reversed or not.

III. FOUR-CONTEXT ODRGA SYSTEM I

A. Holographic Memory Calculation

Here, the calculation method of a holographic memory is described. A hologram for ODRGAs is assumed as a thin holographic medium. A laser aperture plane, a holographic plane, and an ODRGA-VLSI plane are parallelized. The laser beam is expanded and is assumed for which the aperture size is fully wide for the holographic medium. Therefore, the laser beam can be considered as a plane wave. The reference wave from the laser propagates into the holographic plane. The holographic medium comprises rectangular pixels on the \( x_1 - y_1 \) holographic plane. The pixels are assumed as analog values. On the other hand, the input object is made up of rectangular pixels on the \( x_2 - y_2 \) object plane. The pixels can be modulated to be either on or off. The intensity distribution of a holographic medium is calculable using the following equation:

\[
H(x_1, y_1) \propto \int_{-\infty}^{\infty} \int_{-\infty}^{\infty} O(x_2, y_2) \sin(kr) dx_2 dy_2,
\]

\[
I = \sqrt{Z_L^2 + (x_1 - x_2)^2 + (y_1 - y_2)^2},
\]

In that equation, \( O(x_2, y_2) \) is a binary value of a reconfiguration context, \( k \) is the wave number, and \( Z_L \) represents the distance between the holographic plane and the object plane. The value \( H(x_1, y_1) \) is normalized as 0–1 for the minimum intensity \( H_{\text{min}} \) and maximum intensity \( H_{\text{max}} \) as the following:

\[
H'(x_1, y_1) = \frac{H(x_1, y_1) - H_{\text{min}}}{H_{\text{max}} - H_{\text{min}}}. \quad (12)
\]

Finally, the normalized image \( H' \) is used for implementing the holographic memory. The other areas on the holographic plane are opaque to the illumination.

B. Experimental System

For confirmation of the ODRGA architecture’s fast-configuration advantages, first, an ODRGA holographic memory system not using the differential reconfiguration strategy and the ODRGA architecture’s inversion technique were demonstrated. The ODRGA holographic memory system has four configuration contexts. It includes liquid crystal spatial light modulator (LC-SLM) as a holographic memory and a 532 nm – 300 mW laser as laser light sources, as portrayed in Fig. 5. The photograph is shown in Fig. 6. In this experiment, one laser was shared for four configurations because of experimental resources. The laser beam was collimated; the beam was incident to a holographic memory on the LC-SLM. The holographic memory was calculated using (11) and (12), as shown in Fig. 7(e). The LC-SLM’s recording area is divided to four areas, each consisting of 300 \( \times \) 300 pixels and each including configuration information of a XNOR circuit, an AND circuit, a XOR circuit, or a NOR circuit. The LC-SLM is a projection TV panel (L3P07X-31G0; Seiko Epson Corp.). It is a 90° twisted nematic device with a thin film transistor. The panel consists of 1,024 \( \times \) 768 pixels, each of which is 14 \( \times \) 14 \( \mu \)m\(^2\). The LC-SLM is connected to an evaluation board (L3B07X-E10A; Seiko Epson Corp.). The video input of the board is connected to the external display terminal of a personal computer. Programming for the LC-SLM is executed by displaying a holographic memory pattern of green with 256 gradation levels on the personal computer display. A
holographic pattern in Fig. 7(e) is displayed on the LC-SLM. The ODRGA-VLSI was placed at a distance of 100 mm from the LC-SLM.

In this experiment, a 0.35 \( \mu \)m triple-metal CMOS process-fabricated ORGA-VLSI chip like that shown in Fig. 6(c) was used. The photodiodes were constructed between the N-Well layer and the P-substrate. The photodiode size and distance between photodiodes were designed as 25.5 \( \times \) 25.5 \( \mu \)m\(^2\) and as 90 \( \mu \)m to facilitate the optical alignment. The gate array structure is fundamentally identical to that of typical FPGAs. The ORGA-VLSI chip includes 4 logic blocks, 5 switching matrices, and 12 I/O bits. The number of photodiodes used to program the gate array is 340.

C. Experimental Results

Using the previously described experimental system setup, the configurations were executed. Four configuration contexts of a XNOR circuit, an AND circuit, a XOR circuit, and a NOR circuit were programmed onto four areas of the LC-SLM. Their

CCD-captured images at the position of ODRGA-VLSI are depicted in Fig. 8. The reconfiguration periods of the XNOR circuit, the AND circuit, the XOR circuit, and the NOR circuit were measured respectively as 812 ns, 1,000 ns, 854 ns, and 917 ns. The results are much faster than those obtained using
currently available FPGAs. However, these experimental results were obtained using a normal configuration method; the experiments were not executed with the differential reconfiguration strategy and the inversion technique of the ODRGA architecture. Section IV presents a description of the improvements obtained when using the differential reconfiguration strategy and the inversion technique.

IV. FOUR CONTEXT EXPERIMENTS EXPLOITING DIFFERENTIAL RECONFIGURATION AND INVERSION TECHNIQUE

A. Experimental Results

Successive configurations were executed using the experimental system setup described previously. Here, only the collimated beam size differs from that used in the previous experiment. A configuration context of an AND circuit and a reconfiguration context from the AND circuit to an OR circuit exploiting the differential reconfiguration architecture were programmed onto the LC-SLM. Fig. 9(a) and (b), respectively, portrays the configuration context pattern of the AND circuit and the reconfiguration context pattern from the AND circuit to the OR circuit. Fig. 9(c) portrays a calculated multicontext holographic memory pattern storage of the AND circuit and the OR circuit. First, configuration of the AND circuit was executed, as portrayed in Fig. 9(f). A CCD-captured image of the configuration context of the AND circuit at the position of the ODRGA-VLSI is depicted in Fig. 9(d). The optical configuration context was programmed onto the ODRGA-VLSI. Results confirmed that the AND circuit works correctly. Then, reconfiguration from the AND circuit to an OR circuit was executed as presented in Fig. 9(f) using the differential reconfiguration strategy. A CCD-captured image of the reconfiguration context from the AND circuit to the OR circuit at the position of the ODRGA-VLSI is depicted in Fig. 9(e). After the reconfiguration, it was confirmed that the OR circuit works correctly. Finally, the number of bright bits for such a reconfiguration was reduced to two bits. The reconfiguration time was confirmed as 83.3 ns by exploiting the ODRGA architecture.

Moreover, a configuration context of an XOR circuit and a reconfiguration context from the XOR circuit to a NOR circuit...
exploiting the differential reconfiguration architecture were programmed onto the LC-SLM. Fig. 10(a) and (b), respectively, show the configuration context pattern of the XOR circuit and the reconfiguration context pattern from the XOR circuit to the NOR circuit. Fig. 10(c) portrays a calculated multicontext holographic memory pattern storing the XOR circuit and the NOR circuit. First, configuration of the XOR circuit was executed, as portrayed in Fig. 10(f). A CCD-captured image of the configuration context of the XOR circuit at the position of the ODRGA-VLSI is depicted in Fig. 10(d). The optical configuration context was programmed onto the ODRGA-VLSI. Results therefore confirmed that the XOR circuit works correctly. Then, reconfiguration from the XOR circuit to a NOR circuit was executed using the differential reconfiguration strategy, as presented in Fig. 10(f). A CCD-captured image of the reconfiguration context from the XOR circuit to the NOR circuit at the position of the ODRGA-VLSI is depicted in Fig. 10(e). After reconfiguration, it was confirmed that the NOR circuit functions correctly. Finally, the number of bright bits for such a reconfiguration was reduced to two bits. The reconfiguration time was confirmed as 83.3 ns by exploiting the ODRGA architecture.

Next, the inversion technique advantage was demonstrated. A configuration context of an AND circuit and a reconfiguration context from the AND circuit to a NAND circuit exploiting the differential reconfiguration architecture and the inversion technique were programmed onto the LC-SLM. Fig. 11(a) and (b), respectively, depicts the configuration context pattern of the AND circuit and the reconfiguration context pattern from the AND circuit to the NAND circuit. Fig. 11(c) portrays a calculated multicontext holographic memory pattern storing the AND circuit and the NAND circuit. First, a configuration of the AND circuit was executed, as portrayed in Fig. 11(f). A CCD-captured image of the configuration context of the AND circuit at the position of the ODRGA-VLSI is depicted in Fig. 11(d). The optical configuration context was programmed onto the ODRGA-VLSI. Results confirmed that the AND circuit works correctly. Then, a reconfiguration from the AND circuit to a NAND circuit was executed as presented in Fig. 11(f) using the differential reconfiguration strategy and the inversion technique. A CCD-captured image of the reconfiguration context from the AND circuit to the NAND circuit at the position of the ODRGA-VLSI is depicted in Fig. 11(e). After reconfiguration, it was confirmed that the NAND circuit works correctly. Finally, the number of bright bits for such reconfiguration was reduced to a bit. The reconfiguration time was confirmed as 41.7 ns by exploiting the ODRGA architecture.

Moreover, a configuration context of an XOR circuit and a reconfiguration context from the XOR circuit to an XNOR circuit exploiting the differential reconfiguration architecture and the inversion technique were programmed onto the LC-SLM. Fig. 12(a) and (b), respectively, presents the configuration context pattern of the XOR circuit and the reconfiguration context pattern from the XOR circuit to the XNOR circuit. Fig. 12(c) portrays a calculated multicontext holographic memory pattern storing the XOR circuit and the XNOR circuit. First, a configuration of the XOR circuit was executed, as portrayed in Fig. 12(f). A CCD-captured image of the configuration context of the XOR circuit at the position of the ODRGA-VLSI is depicted in Fig. 12(d). The optical configuration context was
Fig. 11. Reconfiguration experiment form an AND circuit to a NAND circuit. Panels (a) and (b) show the configuration context pattern of the AND circuit and the reconfiguration context pattern from the AND circuit to the NAND circuit, respectively. Panel (c) portrays a calculated multicontext holographic memory pattern storing the AND circuit and the NAND circuit. Panel (d) and (e) show CCD-captured images of the configuration context pattern of the AND circuit and the reconfiguration context pattern from the AND circuit to the NAND circuit, respectively. Panel (f) shows the waveform of the 41.7 ns reconfiguration form the AND circuit to the NAND circuit.

programmed onto the ODRGA-VLSI. Consequently, the XOR circuit was confirmed to have worked correctly. Then, recon-

Fig. 12. Reconfiguration experiment form an XOR circuit to an XNOR circuit. Panels (a) and (b) show the configuration context pattern of the XOR circuit and the reconfiguration context pattern from the XOR circuit to the XNOR circuit, respectively. Panel (c) portrays a calculated multicontext holographic memory pattern storing the XOR circuit and the XNOR circuit. Panel (d) and (e) show CCD-captured images of the configuration context pattern of the XOR circuit and the reconfiguration context pattern from the XOR circuit to the XNOR circuit, respectively. Panel (f) shows the waveform of the 41.7 ns reconfiguration form the XOR circuit to the XNOR circuit.

reconfiguration of an XOR circuit to an XNOR circuit was executed as presented in Fig. 12(f) using the differential reconfig-
uration strategy and the inversion technique. A CCD-captured image of the reconfiguration context from the AND circuit to the XNOR circuit at the position of the ODRGA-VLSI is depicted in Fig. 12(e). After the reconfiguration, the XNOR circuit was verified as functioning correctly. Finally, the number of bright bits for such reconfiguration was reduced to a bit. The reconfiguration time was confirmed as 41.7 ns by exploiting the ODRGA architecture.

Finally, the number of bright bits for such reconfigurations was reduced to a bit. The reconfiguration time was confirmed as 41.7 ns by exploiting the ODRGA architecture. It must be emphasized that the 41.7 ns optical reconfiguration is the world’s fastest result. Moreover, the result shows that the circuit functions during the reconfiguration process. For this reason, in the four-context ODRGA, fast configuration was confirmed with no idling of the gate array. The ODRGA architecture is a useful technique to achieve rapid reconfiguration.

V. DISCUSSION FOR PRACTICAL APPLICATIONS

This section presents discussion of some issues related to ODRGAs’ future practical applications and necessary actions.

A. Gate Count

In ODRGAs, although a large virtual gate count can be realized using the large storage capacity of a holographic memory, an actual gate count, which is the gate count of a programmable gate array VLSI, is important to increase the instantaneous performance. Although the real gate count of the first prototype 68-gate ODRGA-VLSI chip was too small for practical applications, the future prospect of ODRGA-VLSIs were already reported to be as large as FPGAs [18]. In the case of 45 nm process, the gate count was estimated as 7 k gates/mm². Therefore, 2.8 million gates can be implemented on a 20 mm × 20 mm die chip. Precise comparison reveals that the ODRGA gates are slightly fewer than those of FPGAs. However, given 1,000,000 reconfiguration contexts in a holographic memory and a 1 million gate count ODRGA-VLSI, then a 1 Tera gate count VLSI is possible. Therefore, the ODRGA presents the possibility of realizing huge virtual gate-count VLSIs. It therefore presents a gate count advantage compared to FPGAs.

B. Alignment Precision

The ODRGA consists of a holographic memory, a laser diode array, and a gate array VLSI. Of course, although the ODRGAs are extremely advanced devices, the ODRGA requires high-precision alignment between a holographic memory and an ODRGA-VLSI and between a laser array and the holographic memory. To date, such an alignment has been analyzed theoretically and experimentally [17]. Results show that 1 μm precision alignment is sufficient to assemble ODRGAs. Therefore, ODRGA packages can be assembled using currently available manufacturing technologies.

C. Power Consumption

The power consumption issue is a salient point of difficulty related to dynamic reconfiguration devices. For ODRGAs, optical power consumption might become the main component of power consumption. For example, as an ODRGA-VLSI gate count increases, necessary optical power consumption is also increased, as shown in Fig. 3. Here, programming for a 1 million fine grain gate array is assumed. In this case, about 3,000,000 photodiodes are necessary to program it. Each photodiode of the current prototype ODRGA-VLSI can respond to $2.1 \times 10^6$ photons ($\lambda = 532$ nm). On the other hand, the diffraction efficiency of the liquid crystal holographic memory was about 0.05%. In this case, a 10 W laser is required for a 231 μs configuration. However, such high power consumption can be lessened by increasing the efficiency of the holographic memory, and the photodiodes. For example, the diffraction efficiency of a current amplitude-modulation-type holographic memory is not good. However, the use of phase-modulation-type holographic memory will increase its efficiency dramatically. Moreover, the photodiode sensitivity of current prototype ODRGA-VLSI chip is not good because the ODRGA-VLSI chip was not fabricated using the photodiode process. According to results presented in one paper [10], if the ODRGA-VLSI chip can use the photodiode process, the number of photons to which a photodiode can respond can be decreased to 800. In addition, the power consumption will be improved because a 100 mW laser will achieve a less than 1 μs configuration for 1 million gates if the system uses a phase-modulation-type holographic memory.

D. Reconfiguration Advantage for FPGAs

In an earlier discussion of power consumption, the reconfiguration laser power for a future ODRGA with 1 million gates was estimated. In fact, less than 1 μs reconfiguration can be achieved using reasonable laser power. Here, an FPGA with the same 1 million gates is assumed. The FPGA is assumed to be reconfigured at 100 MHz. At that time, 10 ms is required to program it. Therefore, comparing ODRGAs with FPGAs, ODRGA presents the important advantage of extremely rapid reconfiguration.

E. Dynamic Reconfiguration Advantage

Recently, almost all computer systems have come to use a reduced instruction set computer (RISC) architecture. That architecture presents advantages in terms of higher clock frequency, smaller implementation area, and lower power consumption than conventional complex instruction set computer (CISC) architecture. The success is based on the fact that, in the circuit implementation, a simple circuit is the best means to realize high-clock frequency or high performance. That principle can be adapted also to programmable devices. Currently, a general purpose unit is always implemented onto a programmable device. Reconfiguration is never used. However, if clock-by-clock reconfigurable devices can be used, a single function or a single instruction set computer can be implemented onto it. Of course, the function will be an unused circuit after a clock cycle. However, at that time, the unused circuit need only be reconfigured to another necessary circuit if we can use clock-by-clock reconfigurable devices. Such a single circuit or a single-instruction-set computer can function at an extremely high clock frequency and provide extremely high performance. The performance of a gate array can therefore be improved compared with current general purpose unit...
implementation or multifunction implementation. Moreover, such a single circuit or a single-instruction-set computer can be implemented in a small implementation area or on a small die. Consequently, using the same implementation area, large parallel computation is possible. For the reasons described above, the overall performance can be increased dramatically. This is a future use of programmable devices. For that reason, a fast reconfigurable device with many reconfiguration contexts is necessary for such realization. The FPGAs’ reconfiguration capability is insufficient even if FPGAs can be partially reconfigured. Therefore, ODRGAs can open a new computation paradigm.

VI. CONCLUSION

To realize faster reconfigurations than those afforded by conventional ORGAs, an Optically Differential Reconfigurable Gate Array (ODRGA) has been developed. Nevertheless, rapid configuration experiments performed under multiple contexts exploiting the ODRGA architecture have never been reported. This paper has introduced and described a demonstration of a four-context ODRGA system. First, normal configuration periods not using ODRGA architecture were measured as 812 ns, 1,000 ns, 854 ns, and 917 ns, which are faster than that of FPGAs. Moreover, the second reconfiguration time was confirmed as 41.7 ns by exploiting the ODRGA architecture. In fact, the 41.7 ns optical reconfiguration is the world’s fastest result. In addition, the result shows that circuits work during the reconfiguration processes. Using this architecture, the period of time necessary for the reconfiguration procedure is negligible. Finally, in the four-context ODRGA, fast configuration with no idleness of a gate array has been confirmed, which demonstrates that the ODRGA architecture is an extremely useful technique for rapid reconfiguration.

ACKNOWLEDGMENT

The VLSI chip in this study was fabricated in the chip fabrication program of VLSI Design and Education Center (VDEC), the University of Tokyo in collaboration with Rohm Co., Ltd., and Toppan Printing Co., Ltd.

REFERENCES


Mao Nakajima (S’07) was born in Shizuoka, Japan, in March, 1986. He is currently a graduate student in the Department of Electrical and Electronic Engineering at Shizuoka University, Hamamatsu, Japan. His current research interests are optically reconfigurable gate arrays.

Minoru Watanabe (M’00) received B.S. and M.S. degrees in opto-electronic and mechanical engineering from Shizuoka University, Hamamatsu, Japan, in 1992 and 1994, respectively, and the Ph.D. degree in computer science and systems engineering from the Kyushu Institute of Technology, Fukuoka, Japan, in 2005. He joined Nissan Motor Co., Ltd., Zama, Japan, in 1994, where he was a development engineer of embedded systems until December 1999. He was a Research Associate in the Department of Control Engineering and Science during January 2000–2003 and in the Department of Systems Innovation and Informatics during 2004–2006, Kyushu Institute of Technology, Fukuoka, Japan. He is currently an Associate Professor in the Department of Electrical and Electronic Engineering, Shizuoka University. His current research interests are programmable devices and optically reconfigurable gate arrays.

Dr. Watanabe is a member of IEEE, the Institute of Electronics, Information, and Communication Engineers of Engineers of Japan, the Japan Society of Applied Physics, and the Society of Instrument and Control Engineers.