In accordance with an example embodiment of the present invention, disclosed is a method and an apparatus thereof for removing jitter introduced by a packet switched network. Each received audio frame comprises a primary portion and a redundancy portion. The redundancy portion comprises a partial redundant copy of a previous frame that is offset by k frames. If a frame n is lost, a frame n+k that comprises the partial redundant copy of the lost frame n, is located in a jitter buffer. Based on the frame n+k, a substitute frame n' substituting the lost frame n is created and a substitution indicator of the substitute frame n' is set to indicate that the redundancy portion of the substitute fame n' should be used in decoding.