Playing samples on the PSG
Latest updates:
2005-06-01
Added information about the PSG’s logarithmic volume scale, and how to play 8-bit samples on the PSG. Thanks go to Arturo Ragozini for pointing this out.
I wrote a question about playing samples on the PSG to the MSX Mailinglist once, and Ricardo Bittencourt replied with an explanation of various techniques, which I have quoted below. After that the article continues, describing the logarithmic nature of the PSG volume, and how to utilise it to achieve 8-bit samples on the PSG.
Table of contents:
Ricardo’s explanation
From: Ricardo Bittencourt Vidigal Leitao
To: MSX Mailinglist
Date: monday, june 22nd 1998, 18:58
Subject: Re: A question about the PSG & samplesOn Mon, 22 Jun 1998, Laurens Holst wrote:
=== How to replay samples via the PSG??? ===
You have to use an undocumented feature of the PSG. When you select a period of 0 in the register 6, the noise produced is just like the noise when you select 1 in the same register. But this does not apply to the square wave generators. When you write a 0 to registers 0 and 1, what’s happening is that you TURN OFF THE OSCILLATORS. Since the PSG uses active low logic, the signal on the output is set to “1” and doesn’t change with time. Now comes the trick. This “1” is affected by the register 8 (volume register). This way, if you change the value of register 8 very quickly, you can modulate the output and generate a nice 4-bit PCM. This method is used in the game “Aleste 2”.
I made a program to test this feature, it’s called “readwav” and it can play .WAV files of 11 kHz. It can be found at http://www.lsi.usp.br/~ricardo/msx.htm (note: this page apparantly doesn’t exist anymore, it has moved, but I can’t find the program on the new page). The maximum wave size is about 50kb but Walter “Marujo” made a new version that plays files up to 100kb, I’ll try to add it to the home page as soon as possible. There is no source code included, but my original assembly source doesn’t have any comments, the best you can do is disassemble it by hand (it has only 512 bytes anyway). Oh, this 11 kHz is arbitrary; by removing a void-loop in the middle of the code you can reach up to 35 kHz.
Please note this is not the only way to generate samples on MSX1. You can also use the keyclick to generate 1-bit PCM (used in the game “Super Laydock” for example).
Another undocumented feature of the PSG permits a variation on the first method. Most people think the lower bits of register 7 are used to set the volume of a channel to zero. This is not true, the bit actually controls the oscillator. So, by disabling a channel in register 7, you can write any values you want in registers 1 and 0 and still use the initial method (btw, this is used in “Oh Shit”).
A last method is to select a sound with a very low frequency, and change the volume faster than this frequency. This method is not used in any MSX game that I know, since it is very inaccurate. But it’s used a lot in the Sega Master System, in games like “Afterburner”. The Sega Master System, for those who don’t know, is a videogame system heavily based on the MSX. It uses a Z80 and a sound chip with the same characteristics as the AY-3-8910 (but it doesn’t have envelopes).
- Ricardo Bittencourt
Note that the example Ricardo gives uses processor-dependant timing. I myself wrote a sample player for a printer port DAC (‘SiMPL’) once, which ‘calibrated’ itself first before playing a sample, making it work independantly of the processor’s speed. I basically did that by playing a ‘silent’ sample a number of times at different rates during fixed intervals, and measuring at what rate it could fit the desired number of samples within the interval. Actually it was funnier to use a nonsilent sample (a sine wave instead) ^_^.
Logarithmic volume scale
One thing that this email fails to mention is that the volume control of the PSG operates on a logarithmic scale. Every 2 volume decrements cut the output volume in half. Because of this, you cannot just take the most significant 4 bits of an existing sample and send it to the PSG. Instead, you have to map the values to a logarithmic scale. The formula for this scale is:
Formula (MathML) | Formula (non-MathML) |
---|---|
y = 2^-((15-n)/2) |
Where is the volume, a value from 0 … 15. There is one exception: 0. When the volume is zero, so is the output.
This corresponds to the following table:
PSG volume | DAC output | 8-bit sample |
---|---|---|
0 | 0 | 0 |
1 | 0,0078125 | 2 |
2 | 0,0110485 | 3 |
3 | 0,015625 | 4 |
4 | 0,0220971 | 6 |
5 | 0,03125 | 8 |
6 | 0,0441942 | 11 |
7 | 0,0625 | 16 |
8 | 0,0883883 | 23 |
9 | 0,125 | 32 |
10 | 0,1767767 | 45 |
11 | 0,25 | 64 |
12 | 0,3535534 | 90 |
13 | 0,5 | 128 |
14 | 0,7071068 | 180 |
15 | 1 | 255 |
Because of this logarithmic scale, I’d say the actual sample quality of the PSG, even though 4-bits, is really comparable to a 3- or perhaps even 2-bit linear DAC.
Thanks go to Arturo Ragozini for pointing this out.
Playing better samples on the PSG
However, that same logarithmic scale can also work to our advantage. The PSG mixer in MSX machines is simple, the channels are simply added together, which allows us to combine the power of the three channels. By doing that, we can compensate for the lack of precision at e.g. the 0.5 output value of channel 1 (volume 13) by using the lower ranges of channel 2! For example, if you would want to output 0.51, you would generate an additional output of 0.01 on channel 2 (volume 2) which is then added to the first channel. When combining three channels this way, we can get 608 discrete sample values!
608 values, that’s about 9 bits of sample information. Not too shabby :). Note that the values are not evenly spaced apart so most of them will have deviations compared to a linear scale, ranging from small ones in the lower ranges to bigger ones in the top ranges. Because of that, the top range is much less useful, but it would probably distort anyway because of the high amplitude.
I created a C# program to calculate what combinations of volumes would have to be used, and to brute force check over what range the deviations would be the least. Without boring you with all the specific details, I have found that when running over a range from 0 to 1.328 (out of 3), the signal-to-noise ratio is the best. The resulting lookup table for that range, mapping 8-bit sample values to PSG channel volumes, is included in the replay routine below:
; ;PSG sample replay routine ; ;hl = sample start address ;de = sample length ; exx ld c,#A1 ld d,0 exx Loop: ld a,(hl) inc hl exx ld e,a ld hl,PSG_SAMPLE_TABLE add hl,de ld b,(hl) inc h ld e,(hl) inc h ld h,(hl) ld a,8 out (#A0),a ;play as fast as possible inc a out (c),b out (#A0),a out (c),e inc a out (#A0),a out (c),h ld b,8 ;timing wait loop WaitLoop: djnz WaitLoop exx dec de ld a,d or e jp nz,Loop ret PSG_SAMPLE_TABLE: DB 00,01,02,03,04,03,05,03,04,05,06,06,05,06,06,06 DB 06,06,07,06,07,08,08,08,07,07,09,07,09,09,08,08 DB 09,09,08,09,09,09,09,09,10,10,10,10,09,09,10,10 DB 10,10,09,10,11,11,11,11,11,11,11,11,10,10,10,11 DB 11,11,11,11,11,11,11,12,11,11,12,12,11,12,11,12 DB 12,12,12,11,12,11,12,12,12,12,11,12,12,12,12,11 DB 12,13,12,13,11,13,13,13,13,13,13,11,13,13,13,13 DB 13,13,13,12,13,13,13,12,12,13,12,13,13,13,13,13 DB 13,12,13,13,13,13,13,13,13,14,13,13,14,14,14,14 DB 14,14,13,14,14,13,14,14,14,14,14,14,13,14,14,14 DB 14,14,14,13,14,14,13,14,14,13,13,14,14,14,14,14 DB 14,14,14,14,13,14,14,13,14,14,14,14,14,14,13,14 DB 14,14,15,14,15,15,15,15,15,15,15,15,15,15,15,15 DB 14,15,15,15,15,15,15,14,15,15,15,15,15,15,15,15 DB 15,15,15,15,15,15,15,15,15,15,15,14,15,14,14,14 DB 14,14,15,15,14,15,15,14,15,15,15,15,15,15,15,14 DB 00,00,00,00,00,02,00,02,02,03,01,02,04,04,03,04 DB 04,05,04,05,05,02,03,04,06,06,01,06,02,03,06,07 DB 05,06,07,06,06,06,07,06,04,04,05,06,08,07,06,06 DB 07,06,08,07,03,04,03,04,04,05,05,05,08,09,09,07 DB 07,07,08,07,08,08,08,02,08,09,03,05,09,05,08,06 DB 06,07,06,10,07,09,08,07,08,08,09,08,08,09,08,10 DB 09,00,08,01,10,02,03,04,04,05,06,10,06,06,06,07 DB 06,07,07,10,08,08,07,11,11,08,11,08,09,09,09,08 DB 09,11,09,09,10,10,10,10,10,00,10,09,02,02,04,03 DB 04,04,11,05,05,11,07,07,07,07,07,08,10,08,08,08 DB 08,08,09,11,09,09,12,08,09,12,11,09,10,10,09,10 DB 10,10,10,09,11,10,10,12,10,10,11,11,11,10,12,11 DB 11,11,00,11,01,02,03,04,03,04,04,05,05,05,06,07 DB 12,07,07,07,08,07,08,12,08,08,08,09,08,09,09,09 DB 08,09,09,09,09,10,10,09,10,10,10,13,09,13,13,13 DB 13,13,10,11,13,11,10,13,11,11,11,11,11,10,10,12 DB 00,00,00,00,00,00,00,01,01,00,00,00,01,00,02,02 DB 03,02,01,04,01,01,01,01,03,04,00,05,01,01,04,01 DB 01,00,04,02,03,04,01,05,01,02,01,00,02,06,03,04 DB 01,05,06,04,00,00,02,02,03,02,03,04,06,02,03,02 DB 03,04,00,05,02,03,04,00,05,00,02,00,03,02,07,01 DB 02,00,04,00,03,07,00,05,02,03,08,04,05,00,06,07 DB 03,00,07,00,08,01,01,01,02,01,00,09,02,03,04,01 DB 05,03,04,07,01,02,06,01,02,05,04,06,02,03,04,07 DB 05,07,06,06,00,01,02,03,04,00,05,08,00,01,00,02 DB 02,03,00,03,04,03,00,01,02,03,04,00,09,02,03,04 DB 04,05,00,08,02,03,00,07,05,03,09,06,00,01,07,03 DB 04,04,05,08,10,06,06,08,07,07,00,00,01,08,09,04 DB 05,05,00,06,00,00,00,00,02,02,03,02,03,04,03,00 DB 01,02,03,04,00,05,02,06,04,04,05,00,06,02,03,04 DB 07,05,05,06,06,00,01,07,03,04,04,00,08,02,03,04 DB 04,05,07,00,06,01,08,07,04,05,05,06,06,09,09,11
In these values, each block of bytes corresponds to a PSG channel, and each index from 0…255 in one of these blocks bytes corresponds to a sample value point. The total maximum volume you should be able to generate with these is about 30% more than a single channel outputting at maximum volume. The speed can be optimized a little more by aligning the table to a multiple of 256 address, if you need it.
That should give you enough of a start to create nice PSG samples. For completeness’ sake, here is a link to download the C# program I created to find the optimal range with the least amount of errors: PSG_Sample.cs. If you want to output at a different volume, you can recalculate the table with it.
Again, thanks go to Arturo Ragozini for thinking this through.