DataСompression.info

data compression link collection

Speech

Compression of human speech is a specialized form of audio compression. Greater levels of compression can be achieved by modeling the human vocal tract, and taking advantage of redundancy in human speech.


OpenLPC Codec

A low bitrate codec, described as being derived from the work of Ron Frederick. Freeware.

ftp://ftp.futuredynamics.com/OpenLPC/

* * * *  

Posted in May 8th, 2001

HawkVoice Speech Samples

Published in Files, Speech

Some speech samples that have been encoded at various rates using various codecs. If you’ve never heard speech encoded at 1.4 Kbps, here’s a chance to check it out.

http://www.hawksoft.com/hawkvoice/codecs.shtml

         

Posted in May 8th, 2001

Hawk Software HawkVoice

HawkVoice is a game oriented, multiplayer voice over network API released under the GNU Library General Public License (LGPL), with
support for Linux/Unix and Windows 9x/ME/NT/2000. It is designed to be a portable, open source code alternative to DirectPlay(R) Voice in DX8.

http://www.hawksoft.com/hawkvoice/

* * * * *

Posted in March 9th, 2001

Digital Speech Compression - Putting the GSM 06.10 RPE-LTP algorithm to work

An article by Jutta Degener describing implementation of this GSM codec.

http://www.ddj.com/articles/1994/9412/9412b/9412b.htm

         

Posted in October 22nd, 2000

What is mu-law encoding? Where can I get source for it?

Published in Newsgroup Posts, Speech

Part of the DSP FAQ, in our case it is an answer to a speech coding question.

http://www.hr/josip/DSP/FAQ/27.html

         

Posted in September 24th, 2000

TrueSpeech

The DSP Group makes a codec called TrueSpeech. You can get a copy of the player for free here to check out the 8.5 KBps audio it makes possible.

http://www.dspg.com/dspg/truespeech_compression.html

* *      

Posted in September 24th, 2000

SpeakFreely

Published in Source Code, Speech

This freeware program is designed to perform Internet telephony. It incorporates source for a couple of interesting speech codec, which is why it gets a link in the library.

http://www.speakfreely.org/

* * *    

Posted in September 24th, 2000

IstraSoft

IstraSoft appears to make some speech codecs, supplied as Windows DLLs. Plus some discussion on their web page re: English phonetics.

http://www.istrasoft.com/

         

Posted in August 27th, 2000

Standard Speech Codecs

Descriptions of various speech codecs, include G.711, G.721, GSM, and CELP. Each codec gets a brief description plus pointers to additional material and source code.

http://rice.ecs.soton.ac.uk/jason/speech_codecs/standards/index.html

*        

Posted in August 20th, 2000

Speech Compression

A page on speech compression by Nam Phamd. Concentrates on LPC and CELP.

http://www.data-compression.com/speech.html

* * * *  

Posted in July 29th, 2000

LPC-10 speech coder software

Published in Source Code, Speech

This is supposed to be a 2.4 Kbps LPC speech codec.

http://www.arl.wustl.edu/~jaf/lpc/

* * * *  

Posted in July 29th, 2000

13 kbps GSM speech coder.

Published in Source Code, Speech

This is reputed to be C source code for a GSM codec.

ftp://ftp.cs.tu-berlin.de/pub/local/kbs/tubmik/gsm/gsm-1.0.10.tar.gz

* * * *  

Posted in July 29th, 2000

Digital Processing of Speech Signals

Published in Books, Speech

by Lawrence R. Rabiner, Ronald W. Schafer. A basic overview of how to use DSP to deal with speech.

http://www.amazon.com/exec/obidos/ASIN/0132136031/theinternetdatac

         

Posted in July 29th, 2000

Computer Speech : Recognition, Compression, Synthesis

Published in Books, Speech

by Manfred R. Schroeder. How about a customer review of this book?

http://www.amazon.com/exec/obidos/ASIN/3540643974/theinternetdatac

* * * * *

Posted in July 29th, 2000

Very Low bit rate speech coding

A project at Cambridge University that led to a low bit rate speech coder that was used in the HP 620LX Palmtop PC.

http://svr-www.eng.cam.ac.uk/~ajr/speechCoding.html

* *      

Posted in July 15th, 2000

GT Technology

This site advertises a bunch of nifty looking codecs, including TELP, JPEG, Speech, MPEG, and wavelet compressors. I can’t quite determine exactly what form the products come in.

http://www.gt-tech.com/technology.html

         

Posted in June 3rd, 2000

Nuera Products

Published in Hardware, Speech

Nuera makes a couple of speech multiplexers that use CELP compression. You probably won’t be interested in these unless you’ve got a T1 line that is bursting at the seams.

http://www.nuera.com/products/AP200_100.cfm

* * *    

Posted in June 3rd, 2000

ITU Series G Specifications

All the voice coding specs you can shake a stick at. Get your checkbook out, though, these aren’t free.

http://www.itu.int/itudoc/itu-t/rec/g/g700-799/index.html

         

Posted in April 9th, 2000

Sipro Lab Telecom

These folks make several different G.729 and G.723.1 codecs,

http://www.sipro.com/

* * * *  

Posted in January 23rd, 2000

VoiceAge Corporation

Our technology served as the foundation for numerous voice compression products used by voice portal and large multinationals in their efforts to better meet the quality requirements that end users. As example voice email, voice chatting, voice annotation in web site. Our technology is already integrated into software by companies such as Microsoft with its Netshow® software and in its Mediaplayer®, and RealNetworks within its RealAudio® and RealVideo®. In addition, over the last decade this compression technology (over 100 patents) was positioned in several International Telecommunication standards, notably: GSM ,PCS, TDMA ,CDMA, TETRA, G.729 . VoiceAge’s compression technology is well positioned at cross road of internet and voice for next-generation of voice portal.

http://www.voiceage.com

* * * * *

Posted in January 23rd, 2000

Interpolative Coding at 2.8 kbps

These folks at UCSB are encoding speech at 2.8 Kbps. It sounds very good considering the bit rate. Links here to a presentation and an abstract, as well as some samples.

http://scl.ece.ucsb.edu/current/oded/ewi_demo.htm

         

Posted in January 13th, 2000

Cysip DSP Courses

These folks offer some seminars on communcations. On their page, if you go to the links to free software, you will find Matlab code for CELP and LPC Vocoders. This same page also has a wide variety of links for speech coding stuff.

http://www.cysip.com/

* * * * *

Posted in December 31st, 1999

comp.speech FAQ

A web site containing the comp.speech FAQ. In particular, you will be interested in section 3 for this FAQ, which discusses speech coding and compression.

http://svr-www.eng.cam.ac.uk/comp.speech/

* * * *  

Posted in December 31st, 1999

GSM 06.10 lossy speech compression

GSM provides telephone quality speech at a compressed rate of 13 Kbps. Compare this to the 64 KBps required by standard u-law and A-law codes. This site gives lots of info about the GSM format, along with free source code.

http://kbs.cs.tu-berlin.de/~jutta/toast.html

* * * *  

Posted in November 21st, 1999

Digital Voice Systems Inc.

This company makes the AMBE-2000, a Vocoder chip that compresses speech down to a miniscule 4 Kbps. They have an additional product library of low bit-rate codes both in hardware and software.

http://www.dvsinc.com/

* * * *  

Posted in November 21st, 1999

comp.speech WWW site

This site contains the comp.speech FAQ, and also has links to their ftp site, which contains software for speech codecs.

http://www.speech.cs.cmu.edu/comp.speech/

* * * * *

Posted in November 21st, 1999

SHORTEN: Simple lossless and near-lossless waveform compression

Tony Robinson - Technical report CUED/F-INFENG/TR.156. A report on an audio compression algorithm that relies on compression of the waveform with Huffman compression of the residuals.

http://svr-www.eng.cam.ac.uk/~ajr/tr156/tr156.html

         

Posted in November 14th, 1999

Lossless Compression of 3D MRI Brain Images

Typical clinical MRI scanners generate data on the order of about 200 MB each day. Because this data provides important diagnostic information, care must be taken in compressing it, and lossy compression is generally avoided. In this project we investigate different techniques for lossless compression of 3-dimensional MRI scans of the brain. These methods include predictive coding, “model coding” using a reference data set, background suppression, symmetry coding and pyramid coding


Matlab source code is included along with the full text of the paper.

http://www-ise.stanford.edu/class/ee392c/demos/hargreaves_johanson_nayak/

* * * * ½

Posted in November 7th, 1999

Speech Coding

Published in Courses/Seminars, Speech

What appears to be a course or lecture outline on speech coding.

http://svr-www.eng.cam.ac.uk/~ajr/SA95/node78.html

* *      

Posted in November 2nd, 1999