Digital watermarking is the process of embedding information into a digital signal. The signal may be audio, pictures or video, for example. If the signal is copied, then the information is also carried in the copy. In visible watermarking, the information is visible in the picture or video. Typically, the information is text or a logo which identifies the owner of the media. The image on the right has a visible watermark. When a television broadcaster adds its logo to the corner of transmitted video, this is also a visible watermark. In invisible watermarking, information is added as digital data to audio, picture or video, but it cannot be perceived as such. An important application of invisible watermarking is to copyright protection systems, which are intended to prevent or deter unauthorized copying of digital media. Steganography is an application of digital watermarking, where two parties communicate a secret message embedded in the digital signal. Annotation of digital photographs with descriptive information is another application of invisible watermarking. While some file formats for digital media can contain additional information called metadata, digital watermarking is distinct in that the data is carried in the signal itself. The use of the word of watermarking is derived from the much older notion of placing a visible watermark on paper.
Instance of a Digital Watermarking SchemeA general watermarking scheme is defined as:
where E defines the embedding function, D detecting function, R retrieval function and M the message. Furthermore, the embedding parameters Watermarking Life-Cycle PhasesIn general, the usage of digital watermarking can be simplified as follows. An unmarked (mostly original) signal (S, with The complete scenario is defined as life cycle of a watermark, because it begins with embedding and ends with detection/retrieval. This is shown in the following figure with expected secure and insecure parts. The information to be embedded is called a digital watermark, although in some contexts the phrase digital watermark means the difference between the watermarked signal and the cover signal. The signal where the watermark is to be embedded is called the host signal. A watermarking system is usually divided into three distinct steps, embedding, attack and detection. In embedding, an algorithm accepts the host and the data to be embedded and produces a watermarked signal. The watermarked signal is then transmitted or stored, usually transmitted to another person. If this person makes a modification, this is called an attack. While the modification may not be malicious, the term attack arises from copyright protection application, where pirates attempt to remove the digital watermark through modification. There are many possible modifications, for example, lossy compression of the data, cropping an image or video, or intentionally adding noise. Detection (often called extraction) is an algorithm which is applied to the attacked signal to attempt to extract the watermark from it. If the signal was unmodified during transmission, then the watermark is still present and it can be extracted. In robust watermarking applications, the extraction algorithm should be able to correctly produce the watermark, even if the modifications were strong. In fragile watermarking, the extraction algorithm should fail if any change is made to the signal. Watermark ParametersIn general, the fundamental watermarking parameters are classifies into the 7 watermarking properties capacity, complexity, invertibility, transparency, robustness, security and verification (alphabetic order): CapacityThe Capacity is in general divided into embedding and retrieval capacity. Embedding CapacityThe embedding capacity capE of a watermarking scheme is defined as the amount of information that is (seems to be) embedded into the cover object to obtain the marked object. A simple definition for a capacity measure capE would be related to the size of the embedded message, i.e. capE(Ω * ,S) = size(M) = | M | . In addition, capacity is often given relative to the size of the cover object:
Note that such measure only takes into account the information embedded, but not the information that is retrieved. Note, also, that this measure does not consider the possibility of repeat coding, in which the mark is replicated as many times as needed prior to its insertion. All these issues are related to the retrieval capacity which is defined in the retrieval function. Retrieval CapacityThe definition of retrieval capacity defines the capacity with respect to the retrieved message m'. First of all, zero-bit watermarking schemes do not transmit any message, since the watermark w is just detected but a message m' is not retrieved. In such a case, the retrieval capacity of these schemes is zero. For non zero-bit watermarking schemes the retrieval capacity is considered after data extraction. The following retrieval capacity function is defined: In case of repeat coding, the retrieved message is several times longer than the embedded message: There are two relevant comments about this definition of relative capacity. The first is that usually this kind of measure is given in terms of the size of the cover object S: Another capacity measure can be defined in terms of the ratio of correctly recovered bits normalized by pmax. If pmax is unknown, the measure of ComplexityGiven a function F, the complexity of it can be measured. Thereby the effort or investment needed to embed or attack or detect and retrieve the watermark is defined with complexity. A measuring function C is defined as C(F) to measure the complexity of F. If it is adapted to, for example, the embedding function of Ω, then the embedding complexity can be computed C(E,S). Depending on C, for example the computation cost of time, needed memory or IO operations, lines of code, etc. could be measured. The relative complexity of a watermarking scheme Ω * and a particular object S is defines as:
InvertibilityRefers to the property of a watermarking scheme which has the possibility to remove the watermark w from the marked signal SE completely to receive signal S' and if Ω is invertible, then S = S'. To provide this feature, the watermarking algorithms must provide special embedding techniques. Furthermore, secret keys are mostly used to protect the original content from unauthorized access. The measured value of invertibility for a watermarking scheme Ω * is a boolean value. If this value is 0, then Ω * cannot remove w from the marked object. If Ω can remove w completely and S = S', then 1 is returned. RobustnessIn this section, the robustness of a digital watermarking scheme is described. To introduce the robustness itself, the detection success is needed and introduced as first. Detection SuccessTo measure the overall success of a detection or retrieval function, the detection success function is introduced. Therefore, the connection to zero-bit an n-bit watermarking scheme are introduced as follows. For zero-bit watermarking schemes, detDD returns 0, if the watermark could not be successful detected and 1 if the detection function was able to detect the watermark, see the following equation: Watermark RobustnessThe robustness measure robrel of a watermarking scheme is a value in the closed interval [0,1], where 0 is the worst possible value (the scheme is not robust for the signal S) and 1 is the best possible value (the method is robust for the signal S). There is a difference, for example, depending on whether the bit error rate (BER) or byte error rate (BYR) is used to measure the robustness. If the robustness is measured based on the byte error rate robbyte, then a given watermarking scheme is classified as robust if the bytes of the embedded massage (characters) are correctly retrieved. This measurement is similar to the Levenstein distance, which works and measured a distance between two given strings. It is useful in applications scenarios that need to determine how similar two strings are. Another robustness measure function based on the bit error rate robbit returns the percentage robustness of the watermarking scheme measured over the whole attacking and test set and is based on the bit changes within the retrieved message. This measurement is similar to the Hamming distance based on bit-strings. Hence, a watermarking scheme is classified as not robust, if more than ν numbers of retrieved bits are destroyed and the transparency of the attacks if higher than τ. For zero-bit watermarking schemes no retrieval function exists and no classification based on bit or byte error rates are possible. To simplify matters, the robustness measure for zero-bit watermarking schemes is always classified to robbyte. The following example motivates the distinction between the robustness measure based on bit and byte error rate. If the message m="123", with 3 bytes and 3*8=24 bits, is embedded and after attacking, the last 6 bits are destroyed and incorrectly retrieved, then the byte error rate returns, that 2 bytes are correct (the first two) and one is false (the last), which has a value of The following function relates robustness based on the byte error rate to transparency for a zero-bit and n-bit watermarking scheme as follows, given SEA = Ai,j(SE): And the robustness based on the bit error rate related to the transparency for n-bit watermarking schemes is given as: That is, given a marked object SE and all the attacks which attack the watermark, even for optimal embedding and detection parameters ( The functions measure robustness in a worst case sense. When the security of a system is to be assessed, it is usually considered that a given system is as weak as the weakest of its components. Similarly, the equation establishes that the worst possible attack (in the sense that the mark is erased but the attacked signal preserves good quality) in a given family determines how robust the watermarking scheme Ω is. If the best (maximum) transparency amongst all the attacks which destroy the mark is 0.23, then the robustness of the method as given by is 1 − 0.23 = 0.77. However, the functions of the equation introduced above are \textit{relative} to a given object SEA (hence the use of the subindex "rel") but usually to define the robustness of a watermarking scheme as an inherent property not related to any particular object, but to a family or collection of objects. This may be referred to as the absolute robustness (
SecurityDescribed the security of the embedded watermark against specific security attacks. After defining all required security measurements
TransparencyGiven a reference object Sref and a test object Stest the transparency function T provides a measure of the perceptible distortion between Sref and Stest. Without loss of generality, such a function may take values in the closed interval [0,1] where 0 provides the worst case (the signals Sref and Stest are so different that Stest cannot be recognized as a version of Sref) and 1 is the best case (an observer does not perceive any significant difference between Sref and Stest): ![]() The relative transparency for a watermarking scheme Ω * and a particular object S is defined as: ![]() This definition of transparency is related to a particular object S. It is usually better to provide some absolute value of transparency which is not related to a particular object S. A definition of "absolute" transparency is related to a family / mathbbS of objects to be marked, which applies any of the following definitions: * Average transparency:
VerificationDescribed the type of the detection/retrieval function D,R which requires information. Therefore three classifications are available: Non-blind: If the watermarking scheme requires the cover object S, then it is associated as non-blind watermarking scheme. Often, this type of watermark scheme is referred as informed watermarking scheme. Mostly, the watermark detector/retriever is only useable from a defined group of people, which hides the watermark detector and the required original signal S. Informed: If the watermarking scheme requires the embedded message m, the embedding parameters pE or other additional information (except the original signal S) for detection or retrieval, then the watermarking scheme is associated to this group. Often, watermarking schemes where the embedding function creates a data file needed for detection/retrieval, are associated to this type of verification. Blind: If the watermarking scheme does not require the original signal nor additional information (e.g. m or pE), then the watermarking scheme is associated to this group. The verification (ver) is defined as list {0,0.5,1}, whereby the 1 is associated with non-blind, a 0.5 with informed and a 0 with blind. The formalization is introduced in the following equation. ClassificationA digital watermark is called robust with respect to a class of transformations T if the embedded information can reliably be detected from the marked signal even if degraded by any transformation in T. Typical image degradations are JPEG compression, rotation, cropping, additive noise and quantization. For video content temporal modifications and MPEG compression are often added to this list. A watermark is called imperceptible if the cover signal and marked signal are indistinguishable with respect to an appropriate perceptual metric. In general it is easy to create robust watermarks or imperceptible watermarks, but the creation of robust and imperceptible watermarks has proven to be quite challenging [1]. Robust imperceptible watermarks have been proposed as tool for the protection of digital content, for example as an embedded 'no-copy-allowed' flag in professional video content [2]. Digital watermarking techniques can be classified in several ways. RobustnessA watermark is called fragile if it fails to be detected after the slightest modification. Fragile watermarks are commonly used for tamper detection (integrity proof). Modification to an original work that are clearly noticeable are commonly not referred to as watermarks, but referred to as generalized barcodes. A watermark is called semi-fragile if it resist benign transformations but fails detection after malignant transformations. Semi-fragile watermarks are commonly used to detect malignant transformations. A watermark is called robust if it resists a designated class of transformations. Robust watermarks are commonly used in copyright applications (to carry ownership or forensic information) and copy protection applications (to carry copy and access control information). PerceptibilityA watermark is called imperceptible if the original cover signal and the marked signal are (close to) perceptually indistinguishable. A watermark is called perceptible if its presence in the marked signal is noticeable, but non-intrusive. CapacityThe length of the embedded message | m | determines two different main classes of watermarking schemes:
Embedding methodA watermarking method is referred to as spread-spectrum if the marked signal is obtained by an additive modification. Spread-spectrum watermarks are known to be modestly robust, but also to have a low information capacity due to host interference. A watermarking method is referred to be of quantization type if the marked signal is obtained by quantization. Quantization watermarks suffer from low robustness, but have a high information capacity due to rejection of host interference. A watermarking method is referred to as amplitude modulation if the marked signal is embedded by additive modification method which it similar to spread spectrum method but this method is especially embedded in spatial domain. ApplicationsDigital Watermarking can be used for a wide range of applications such as:
Evaluation / BenchmarkingThe evaluation of digital watermarking schemes can provide detailed information for watermark designer or end users. Therefore, different evaluation strategies exists. Often used from watermark designer is the evaluation of single properties to show, for example, an improvement. End users, are mostly not interested in detailed information. They want to know, if a given digital watermarking algorithm can be used for their application scenario, and if yes, which parameter sets seems to be the best. See alsoExternal links
References
| |