Why Does Histogram Packing Improve Lossless ... - CiteSeerX

IEEE SIGNAL PROCESSING LETTERS, VOL. 9, NO. 8, AUG. 2002

1

Why Does Histogram Packing Improve Lossless Compression Rates?

Paulo J. S. G. Ferreira and Armando J. Pinho Dep. Electr´onica e Telecomunicac¸o˜ es / IEETA Universidade de Aveiro 3810-193 Aveiro Portugal Tel: +351-234370525, Fax: +351-234370545 E-mails: {pjf,ap}@det.ua.pt EDICS: SPL.SPTM, SPL.IMSP Abstract The performance of state-of-the-art lossless image coding methods (such as JPEG-LS, lossless JPEG-2000, and CALIC) can be considerably improved by a recently introduced preprocessing technique, which can be applied whenever the images have sparse histograms. Bitrate savings of up to 50% have been reported, but so far no theoretical explanation of the fact has been advanced. This letter addresses this issue, and analyzes the effect of the technique in terms of the interplay between histogram packing and the image total variation, emphasizing the lossless JPEG-2000 case.

This work was partially supported by the FCT. 12th June 2002

DRAFT

2


I. I NTRODUCTION It has been shown recently [1] that the performance of lossless image coding methods such as JPEG-LS [2, 3], lossless JPEG-2000 [4, 5], or CALIC [6], can be improved by a simple preprocessing technique, which can be applied to images with sparse histograms. The method is illustrated in Fig. 1. Before coding, the image is subject to a mapping T , which packs its histogram. Applying T −1 to the uncoded image yields the original. Recent research shows that the method is also useful (often even more so) when the histograms are only “locally sparse” [7]. Bitrate savings of 50% and above have been reported [1]. Some results are shown in Table I, for the lossless JPEG-2000 case (see [1] for details). The results for JPEG-LS and CALIC were omitted for brevity but are very similar. The improvements can be surprising, considering that the methods mentioned represent the state-of-the-art. So far, no theoretical reasons for the performance difference have been advanced. We show in this letter that the preprocessing technique can be understood in terms of its effect on the image total variation. Basically, histogram packing introduces a variation-reducing reversible mapping, and images with smaller total variation are easier to compress (in a certain sense, made precise later). Thus, after applying T and reducing the total variation of the images, the subsequent coding step is able to represent the image more efficiently. We show below that the histogram packing, when performed step by step in a certain way, leads to a sequence of images with decreasing total variation. This is particularly relevant for linear or nonlinear transform coding (and hence for JPEG-2000) since the decay rate of the approximation error between an image and a compressed representation is proportional to the image total variation (or to its square, depending on the approximation being linear or nonlinear). Histogram packing, by reducing the image variation, cuts down the approximation error by roughly a constant amount. As a result, the performance of lossless compression methods, which can be viewed as limit cases of the lossy methods, tends to improve in the same proportion. II. H ISTOGRAM PACKING AND VARIATION We claim that histogram packing is a reversible, variation decreasing mapping. In fact, let the image intensity values be I1 < I2 < . . . < In . The total variation of the image f , neglecting normalization factors, is essentially determined by V ( f ) = ∑ | fi j − fi, j−1 |2 + | fi j − fi−1, j |2 i, j

21

.

Without loss of generality, we may assume that I1 = 0, and that we need to pack the (sparse) histogram only on the right. Thus, we are mapping pixels with intensity values In , and there are intensity values below In (say, Im ) that do not occur in the image. We rewrite V ( f ) as V(f) =

∑ S

DRAFT

| fi j − fi, j−1 |2 + | fi j − fi−1, j |2

21

+

12th June 2002

FERREIRA E PINHO: HISTOGRAM PACKING AND LOSSLESS COMPRESSION

+

∑ S

| fi j − fi, j−1 |2 + | fi j − fi−1, j |2

3

12

where S denotes the subset of terms that involve In , and S its complement. We will now examine the effect of replacing In by Im in each of these two sums. Fist, note that replacing In with any other intensity value Im that did not originally occur in the image does not change the sum over S, since none of its terms contain In or Im (In by construction, and Im because it was not even present in the image). Second, note that each of the terms of the sum over S is characterized by having one or more of f i j , fi, j−1 , or fi−1, j equal to In . Neglecting the square roots because they are monotonic functions, each individual nonzero S term can assume one of the following forms: (In − a)2 + (In − b)2 or (In − a)2 + (b − c)2 . where the intensity values a, b, c are below In and Im . Clearly, replacing In by Im will reduce the value of these terms, and consequently the variation. To summarize, each packing step In → Im leaves the S sum invariant, and decreases the S sum. Hence,

it is variation decreasing.

The same reasoning applies when a set of contiguous extreme intensity values are mapped, rather than just In . The conclusion, by induction, is that histogram packing reduces the total variation of the image. To confirm this, refer to Fig. 2, which illustrates the effect of step-by-step packing on the variation of three images, and the corresponding performance of lossless JPEG-2000 (the side information necessary to invert the packing operation has of course been included in the bitrates). III. VARIATION AND APPROXIMATION ERROR Having recognized the link between histogram packing and total variation, let us examine the role of the latter on coding efficiency. We will argue in the general framework of stable, transform-based, possibly nonlinear approximation methods (see [8, Chapter 9], [9]). Recall that in this context “N-term nonlinear approximation” means approximation using the N most significant coefficients of the expansion, whereas “N-term linear approximation” is related to approximation using a fixed, image-independent set of N coefficients. In the univariate case, the L2 linear approximating error is O(N −1 ), but the nonlinear approximation error satisfies En (N) ≤ CV ( f )2

1 . N2

This decay, obtained with wavelet-based approximation, cannot be improved by any nonlinear approximation in an orthonormal basis. In this sense, wavelets provide optimal representations for bounded variation functions [10]. Nevertheless, if a mapping T can be found such that V (T f ) < V ( f ), and the 12th June 2002

DRAFT

4


side-information necessary to compute T −1 does not rule out the coding gain allowed by the decrease in the variation, then coding T f might be advantageous. This is the idea that renders the preprocessing techniques of interest. For an image of bounded variation, the linear approximation error satisfies El (N) ≤ CV ( f )k f k∞

1 , N 1/2

and the nonlinear approximation error is given by 1 En (N) ≤ CV ( f )2 . N In both cases, the error decreases with the variation of the image, or its square. Therefore, any preprocessing technique that reduces the variation (and the value of the largest image intensity) will lead to better approximations. The performance of lossless compression methods, which can be viewed as limit cases of the lossy methods, will tend to improve in the same proportion. This is confirmed in Fig. 3. IV. L OSSY COMPRESSION Now that the effect of histogram packing on a class of lossless coding techniques has been explained, the natural question to ask is: can packing, or any other preprocessing techniques, improve lossy compression methods as well? The answer is, in general, negative. It is in general impossible to reverse the effect of the preprocessing step without introducing errors, the magnitude of which is hard to control. This happens because the packed image cannot be exactly recovered from its compressed version, except in the lossless case. In the lossy case, the effects of the preprocessing stage cannot be exactly reversed. To put it more precisely: in the lossy case, decoding yields an estimate of the packed image. The histogram of the estimate will in general include intensity values that were not present in the original packed image. These values are not in the range of the packing function, which therefore cannot be inverted. Hence, except in the lossless and close-to-lossless range, the histogram packing technique does not appear to be useful. This does not rule out the existence of other reversible transformations T , which when used as shown in Fig. 1 may lead to overall compression gains, as explained above. This and other related issues remain, to the best of our knowledge, open questions. V. C ONCLUSIONS We have discussed a preprocessing technique which can be used in the lossless compression of images with sparse histograms, and which leads to substantial improvements when combined with state-of-theart methods, such as JPEG-LS, lossless JPEG-2000, and CALIC. We have shown that histogram packing is a reversible variation-reducing operation, and that, as a result, it cuts down the approximation error in the class of stable transform-based, possibly-nonlinear image compression methods. These observations provide a theoretical explanation and support for the important compression gains reported recently [1] using the packing/unpacking preprocessing/post-processing technique. DRAFT

12th June 2002


5

R EFERENCES [1] A. J. Pinho, “An on-line pre-processing technique for improving the lossless compression of images with sparse histograms”, IEEE Signal Processing Letters, vol. 9, no. 1, pp. 5–7, Jan. 2002. [2] ISO/IEC 14495–1 and ITU Recommendation T.87, Information technology — Lossless and near-lossless compression of continuous-tone still images, 1999. [3] M. J. Weinberger, G. Seroussi, and G. Sapiro, “The LOCO-I lossless image compression algorithm: principles and standardization into JPEG-LS”, IEEE Trans. on Image Processing, vol. 9, no. 8, pp. 1309–1324, Aug. 2000. [4] ISO/IEC International Standard 15444–1, ITU-T Recommendation T.800, Information technology - JPEG 2000 image coding system, 2000. [5] M. W. Marcellin, M. J. Gormish, A. Bilgin, and M. P. Boliek, “An overview of JPEG-2000”, in Proc. of the Data Compression Conf., DCC-2000, Snowbird, Utah, Mar. 2000, pp. 523–541. [6] X. Wu and N. Memon, “Context-based, adaptive, lossless image coding”, IEEE Trans. on Communications, vol. 45, no. 4, pp. 437–444, Apr. 1997. [7] A. J. Pinho, “Preprocessing techniques for improving the lossless compression of images with quasi-sparse and locally sparse histograms”, Proc. of the IEEE Int. Conf. on Multimedia and Expo, ICME-2002, Lausanne, Switzerland, Aug. 2002. [8] S. G. Mallat, A wavelet tour of signal processing, Academic Press, San Diego, 1998. [9] R. A. DeVore, “Nonlinear approximation”, Acta Numer., pp. 51–150, 1998. [10] R. A. DeVore, B. Jawerth, and B. J. Lucier, “Image compression through wavelet transform coding”, IEEE Trans. Inform. Theory, vol. 38, no. 2, pp. 719–746, Mar. 1992.

12th June 2002

DRAFT

6


Original

Mapping

Lossless

Coded

Image

T

Encoder

Image

Coded

Lossless

Inverse

Original

Decoder

T −1

Image

Image Fig. 1.

Pre-processing (T , histogram packing) and post-processing (T −1 , histogram unpacking) for improved

lossless compression.

TABLE I P ERFORMANCE OF LOSSLESS JPEG-2000 WITH AND WITHOUT HISTOGRAM PACKING . T HE COMPRESSION GAINS INCLUDE THE OVERHEAD DUE TO THE SIDE - INFORMATION NECESSARY TO INVERT THE PACKING TRANSFORMATION .

Image

DRAFT

Number of

No Packing

Packed

Intensities

Size

bps

benjerry

48

14,076

4.027

books

7

43,859

cmpndd

133

cmpndn

132

Size

bps

%

9,664

2.765

31.3

6.164

15,318

2.152

65.1

114,362

2.326

98,767

2.009

13.6

107,596

2.189

92,594

1.883

13.9

flax

3

4,060

1.873

1,044

0.481

74.3

gate

69

32,916

4.323

24,316

3.193

26.1

music

8

8,457

5.491

3,180

2.064

62.4

netscape

27

30,769

4.022

17,887

2.338

41.9

sea dusk

43

8,214

0.417

5,894

0.299

28.2

stone

3

21,374

7.607

5,586

1.988

73.9

sunset

138

119,031

3.099

106,013

2.760

10.9

winaw

10

84,913

2.307

33,757

0.917

60.2

yahoo

156

13,782

4.062

13,001

3.832

5.7

Total

—

603,409

—

427,021

—

29.2

12th June 2002


7

Bits per sample

4

3

2 GATE NETSCAPE WINAW 1

0

2

4

6

8

10

12

14

16

18

20

Image variation

Fig. 2. The effect of histogram packing on the variation of three images, and the corresponding bitrates, obtained using lossless JPEG-2000. The dots mark the packing steps. Each step reduces the variation and the bitrate. The bitrate accounts for the side-information necessary to invert the packing transformation.

1

L2 Error

Normal Packed

0.01

NETSCAPE 1e-4 0.01

0.1

1

Fraction of non-zero coefficients Fig. 3. L2 error as a function of the number of coefficients, for the original and packed image (wavelet-based nonlinear approximation). The compression curves on logarithmic plots are expected to differ by constants in zones where the mentioned asymptotic results are close to exact.

12th June 2002

DRAFT