

## Key Design Features

- Synthesizable, technology independent VHDL IP Core
- Decoding of a bayer-mapped image from an image sensor or Colour Filter Array (CFA)
- Supports all image resolutions from 16 x 16 pixels up to 2<sup>16</sup> x 2<sup>16</sup>
- Support for all sensor pixel widths from 2-bits and above
- Support for different CFA sensor alignments
- Fully pipelined architecture with simple flow control
- Input and output ports can interface directly to a FIFO if required
- Features a 5x5 polyphase interpolation filter
- Output 1 x 24-bit pixel per clock
- No frame buffer required
- Small implementation size
- Support for 300 MHz+ operation on basic FPGA devices¹

## **Applications**

- Bayer-mapped to 24-bit RGB decoding (de-mosaicing)
- Digital camera image processing
- Forms an essential first stage in most digital processing pipelines that contain an image sensor

# Generic Parameters

| Generic name    | Description                    | Туре    | Valid range                               |
|-----------------|--------------------------------|---------|-------------------------------------------|
| dw              | Sensor width in bits           | integer | ≥ 2                                       |
| line_width      | Width of linestores in pixels  | integer | 2 <sup>4</sup> < pixels < 2 <sup>16</sup> |
| log2_line_width | Log2 of linestore width        | integer | log2(line_width)                          |
| sensor_align    | Bayer pattern sensor alignment | integer | 0 : BGBG<br>GRGR                          |
|                 |                                |         | 1 : GBGB<br>RGRG                          |
|                 |                                |         | 2 : GRGR<br>BGBG                          |
|                 |                                |         | 3 : RGRG<br>GBGB                          |

# **Block Diagram**



Figure 1: Bayer-to-RGB converter architecture

## **Pin-out Description**

| Pin name         | 1/0 | Description                                                                  | Active state |
|------------------|-----|------------------------------------------------------------------------------|--------------|
| clk              | in  | Synchronous clock                                                            | rising edge  |
| reset            | in  | Asynchronous reset                                                           | low          |
| pixels_per_line  | in  | Number of pixels per line<br>Range: 2 <sup>4</sup> < lines < 2 <sup>16</sup> | data         |
| lines_per_frame  | in  | Number of lines per frame<br>Range: 2 <sup>4</sup> < lines < 2 <sup>16</sup> | data         |
| pixin [dw - 1:0] | in  | n-bit bayer-mapped pixel in                                                  | data         |
| pixin_vsync      | in  | Vertical sync in (Coincident with first pixel of input frame)                | high         |
| pixin_hsync      | in  | Horizontal sync in (Coincident with first pixel of input line)               | high         |
| pixin_val        | in  | Input pixel valid                                                            | high         |
| pixin_rdy        | out | Ready to accept input pixel (Handshake signal)                               | high         |
| pixout [23:0]    | out | 24-bit RGB pixel out                                                         | data         |
| pixout_vsync     | out | Vertical sync out<br>(Coincident with first pixel<br>of output frame)        | high         |
| pixout_hsync     | out | Horizontal sync out (Coincident with first pixel of output line)             | high         |
| pixout_val       | out | Output pixel valid                                                           | high         |
| pixout_rdy       | in  | Ready to accept output pixel (Handshake signal)                              | high         |

<sup>1</sup> Xilinx Virtex6 used as a benchmark



### **General Description**

BAYER\_TO\_RGB (Figure 1) is a fully pipelined Bayer-mapped to RGB converter IP core. The IP core may be used to process the raw pixels from an image sensor or Colour Filter Array (CFA). These pixels are typically organized as a bayer pattern of discrete Red, Green and Blue values which must be interpolated to recover the original image - a process that is commonly known as de-mosaicing.

Internally, the circuit uses a 5x5 pixel filter with dynamic coefficients to interpolate the pixels from the CFA. The resulting output is a high-quality RGB image at 24-bits/pixel.

Bayer-mapped pixels flow into the design in accordance with the valid ready pipeline protocol<sup>2</sup>. Input pixels and syncs are sampled on the rising edge of *clk* when *pixin\_val* and *pixin\_rdy* are both high. Likewise, at the output interface, pixels and syncs are sampled on a the rising edge of *clk* when *pixout\_val* and *pixout\_rdy* are both high.

The image size is fully programmable with standard support for anything from 16x16 pixels and above. The width of the input pixels is specified using the generic parameter *dw*. Output pixels are standard 24-bit RGB.

#### Sensor alignment

The sensor alignment parameter modifies the central starting position of the 5x5 filter according to the alignment of the bayer pattern. Figure 2 below demonstrates the 4 possible sensor alignments in the CFA.



Figure 2: CFA sensor alignments

By setting the *sensor\_align* parameter correctly, the design can adapt to the four possible patterns. If the alignment is wrong, then the colours in the output image will be corrupted.

The size of the image to be interpolated is fully programmable and is specified in the parameters: <code>pixels\_per\_line</code> and <code>lines\_per\_frame</code>. These parameters can be changed on a frame-by-frame basis if necessary. It is recommended that a system reset is asserted once the parameters have been changed to avoid possible image corruption. After reset, the IP core will start generating output pixels after the next clean input frame.

The generic parameters *line\_width* and *log2\_line\_width* must also be set correctly to accommodate the maximum line length of the input image. The line width must be specified as the nearest power of 2 - e.g. 1024, 2048, 4096 etc.

#### De-mosaicing filter

The internal filter is a 5x5 pixel filter that is used to interpolate the bayer-mapped image. The filter architecture uses a polyphase filter design in which the filter kernel changes depending on the interpolation point in the bayer pattern. The filter kernels are based on the Malvar-He-Cutler algorithm which has been shown to give excellent results for a very reasonable resource cost. For optimum results it is recommended that the image sensor has a capacity of at least 5M pixels or better.

## **Functional Timing**

Figure 3 shows the signalling at the input interface at the start of a new frame. The first line of a new frame begins with <code>pixin\_vsync</code> and <code>pixin\_hsync</code> asserted high together with the first pixel. Note that the signals <code>pixin</code>, <code>pixin\_vsync</code> and <code>pixin\_hsync</code> are only valid if <code>pixin\_val</code> is also asserted high. For demonstration purposes, the diagram also shows what happens when <code>pixin\_rdy</code> is de-asserted. In this case, the pipeline is stalled and the upstream interface must hold-off before further pixels are processed.



Figure 3: Start of new input frame

Figure 4 shows the signalling at the start of a new line. Note that the timing diagram is the same as for the start of a new frame with the exception that <code>pixin\_vsync</code> is held low while <code>pixin\_hsync</code> is held high together with the first valid pixel. In this example, <code>pixin\_rdy</code> is held high for the duration so the input interface does not stall.

Image resolution

<sup>2</sup> See Zipcores application note: app\_note\_zc001.pdf for more examples of how to use the valid-ready pipeline protocol





Figure 4: Start of new input line

The final timing waveform (Figure 5) shows the output signalling for the start of a new output frame. The output flow-control is identical to the input with the use of the same valid-ready protocol. In this example, the <code>pixout\_rdy</code> signal is held high during the output of the frame. (Note that if the downstream interface can always accept pixels then <code>pixout\_rdy</code> may be tied to logic '1').



Figure 5: Start of new output frame

## Source File Description

The source files are provided as text files coded in VHDL. The following table gives a brief description of each file.

| Source file            | Description                      |
|------------------------|----------------------------------|
| bayer_in.txt           | Text-based source image file     |
| pipeline_reg.vhd       | Pipeline register component      |
| fifo_sync.vhd          | Synchronous FIFO                 |
| bayer_file_reader.vhd  | Reads image file into test bench |
| ram_dp_w_r.vhd         | Dual-port RAM component          |
| bayer_buffer.vhd       | Line buffer component            |
| bayer_filter.vhd       | 5x5 pixel filter component       |
| bayer_to_rgb.vhd       | Top-level component              |
| bayer_to_rgb_bench.vhd | Top-level test bench             |

#### Functional Testing

An example VHDL testbench is provided for use in a suitable VHDL simulator. The compilation order of the source code is as follows:

- pipeline\_reg.vhd
- 2. fifo sync.vhd
- 3. ram\_dp\_w\_r.vhd
- 4. bayer\_buffer.vhd
- 5. bayer filter.vhd
- 6. bayer\_to\_rgb.vhd
- 7. bayer\_file\_reader.vhd
- bayer\_to\_rgb\_bench.vhd

The VHDL testbench instantiates the BAYER\_TO\_RGB component and the user may modify the generic parameters as required. In particular the user must ensure that the image dimensions and sensor alignment are correct for the input source image.

The input source image for the simulation is generated by the file reader component. The component reads a text file that contains the discrete R,G,B bayer-mapped pixels. The text file is called <code>bayer\_in.txt</code> and should be placed in the top-level simulation folder.

The file <code>bayer\_in.txt</code> follows a simple format which defines the state of the signals: <code>pixin\_val</code>, <code>pixin\_vsync</code>, <code>pixin\_hsync</code> and <code>pixin</code> on a clock-by-clock basis. An example file might be the following:

| 1 1 1 0052 | # pixel 0 line 0 (start of frame) |
|------------|-----------------------------------|
| 1 0 0 0073 | # pixel 1                         |
| 1 0 0 0052 | # pixel 2                         |
| 1 0 0 0072 | # pixel 3                         |
|            |                                   |
| •          |                                   |
| 1 0 1 0072 | # pixel 0 line 1 (start of line)  |
| 1 0 0 0050 | # pixel 1                         |
| 1 0 0 0071 | # pixel 2                         |
| 1 0 0 004d | # pixel 3 etc                     |

In this example the first line of the  $bayer\_in.txt$  file asserts the input signals  $pixin\_val = 1$ ,  $pixin\_vsync = 1$ ,  $pixin\_hsync = 1$  and pixin = 0x0052.

In the example simulation shipped with the source code, a sample VGA image is processed by the Bayer to RGB converter. The simulation must be run for at least 10 ms during which time an output text file called <code>bayer\_out.txt</code> is generated<sup>3</sup>. This file contains a sequential list of 24-bit RGB output pixels in a similar format to <code>bayer\_in.txt</code>.

Figure 6 and 7 show the images before and after de-mosaicing. High quality images can be provided on request.

<sup>3</sup> PERL scripts for generating and processing input and output text files are provided with the IP Core package





Figure 6: Bayer-mapped source image



Figure 7: Output image after de-mosaicing

# Synthesis and Implementation

The files required for synthesis and the design hierarchy is shown below:

- bayer\_to\_rgb.vhd
  - bayer\_buffer.vhd
    - ram\_dp\_w\_r.vhd
  - bayer\_filter.vhd
  - O fifo\_sync.vhd
    - pipeline\_reg.vhd

The VHDL core is designed to be technology independent. However, as a benchmark, synthesis results have been provided for the Xilinx® Virtex 6 and Spartan 6 FPGA devices. Synthesis results for other FPGAs and technologies can be provided on request.

There are no special constraints required for synthesis. The IP core is completely technology independent.

Trial synthesis results are shown with the generic parameters set to: dw = 8,  $line\_width = 2048$ ,  $log2\_line\_width = 11$ ,  $sensor\_align = 0$ .

Resource usage is specified after Place and Route.

#### VIRTEX 6

| Resource type            | Quantity used |
|--------------------------|---------------|
| Slice register           | 903           |
| Slice LUT                | 1277          |
| Block RAM                | 8             |
| DSP48                    | 0             |
| Occupied slices          | 441           |
| Clock frequency (approx) | 300 MHz       |

#### SPARTAN 6

| Resource type            | Quantity used |
|--------------------------|---------------|
| Slice register           | 903           |
| Slice LUT                | 1266          |
| Block RAM                | 9             |
| DSP48                    | 0             |
| Occupied slices          | 436           |
| Clock frequency (approx) | 190 MHz       |

## **Revision History**

| Revision | Change description                                                                                                                   | Date       |
|----------|--------------------------------------------------------------------------------------------------------------------------------------|------------|
| 1.0      | Initial revision                                                                                                                     | 04/10/2012 |
| 1.1      | Moved to 5x5 filter architecture based on Malvar-He-Cutler algorithm                                                                 | 01/03/2013 |
| 1.2      | Updated synthesis results in line with minor code changes. Corrected sensor alignment generic description                            | 15/11/2013 |
| 1.3      | Added full valid-ready flow control to the output interface                                                                          | 27/02/2014 |
| 1.4      | Added sensor width generic. Increased maximum image resolution (16-bit addressing). Various filter optimizations for improved speed. | 18/02/2015 |
|          |                                                                                                                                      |            |

# **X-ON Electronics**

Largest Supplier of Electrical and Electronic Components

Click to view similar products for Development Software category:

Click to view products by Zipcores manufacturer:

Other Similar products are found below:

SRP004001-01 SW163052 SYSWINEV21 WS01NCTF1E W128E13 SW89CN0-ZCC IP-UART-16550 MPROG-PRO535E AFLCF-08-LX-CE060-R21 WS02-CFSC1-EV3-UP SYSMAC-STUDIO-EIPCPLR 1120270005 SW006021-2H ATATMELSTUDIO 2400573 2702579 2988609 SW006022-DGL 2400303 88970111 DG-ACC-NET-CD 55195101-101 55195101-102 SW1A-W1C MDK-ARM SW006021-2NH B10443 SW006021-1H SW006021-2 SW006023-2 SW007023 MIKROE-730 MIKROE-2401 MIKROE-499 MIKROE-722 MIKROE-724 MIKROE-726 MIKROE-728 MIKROE-732 MIKROE-734 MIKROE-736 MIKROE-738 MIKROE-744 MIKROE-928 MIKROE-936 1120270002 1120270003 1120275015 NT-ZJCAT1-EV4