Programmable real-time audio signal processing application
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

notes.txt 21KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455
  1. CONTENTS:
  2. 1) TODO List
  3. 2) OSX Install Notes
  4. SPAT LAB SETUP:
  5. + 1. Stereo
  6. a. TotalMix - select 'diff. stereo' snapshot
  7. b. mixer: Recall preset 91 ('stereo diff')
  8. + 2. Direct - 24 channels - each channel goes in and out on same number.
  9. a. TotalMix - select 'DTS_24ch'
  10. b. mixer: Recall preset 96 ('24ch firecac ufx')
  11. LANGUAGE NOTES:
  12. The primary goal of the language is to initialize a dataflow system.
  13. Sub-goals:
  14. 1) Avoid describing domain specific computation in the language. As much as possible constrain the
  15. language to describe initialization tasks (e.g. object allocation, object connection, initialization
  16. parameters, preset grouping, thread/process allocation, UI layout, ...)
  17. 2) Network Distrubution
  18. 3) UI Layout
  19. Runtime:
  20. + Extensible data-object system.
  21. + Library organization (application objects, simple objects, vector library)
  22. NEW DESIGN NOTES:
  23. + The current designs fundamental weakness is that it uses callbacks to communicate where sequential program
  24. flow would produce a more comprehensible program.
  25. + Dataflow is a natural way to express DSP programs but if it is implemented in a textual, rather than
  26. graphical language, then their are some design princles that must be followed in order
  27. to produce comprehensible programs.
  28. 1) Limit the number of connections as much as possible.
  29. - There are many simple UI->param connections - these should be made automatically and UI
  30. objects should not have to be explicitely created - they should be created by the runtime
  31. environment.
  32. - If multiple pieces of data are part of a single message then they should be sent as a unit
  33. rather than separately. This decreases the number of connections and also removes timing
  34. dependencies - where the application programmer has to know the order of transmission of the
  35. individual pieces. (e.g. MIDI messages always contain {status,d0,d1} rather than having to
  36. send d0 and d1 followed by status to indicate the end of the message {status,d0,d1} should
  37. be sent as a single record.
  38. 2) Eliminate as many event ordering issues as possible
  39. See the example in 1).
  40. 3) Alllow the connections between objects to be made as part of the object allocation.
  41. 4) Allow subprograms to be made. This decreases the complexity of the programs and because
  42. it allows the programs to be organized hierarchically. It also allows the subprograms to
  43. be tested idenpendently.
  44. 5) A natural way to express object multiplicity is required. (e.g. multiple channels).
  45. This leads to a way to naturally create parallel/fan-in/fan-out connections.
  46. + One way to solve some of the problems of the current program would be to add debugging tools.
  47. - Generate dataflow diagrams that show execution order and the order of connections.
  48. The actual sending order of the outputs is not accessible to a static network analyzer because it is
  49. implementation dependent.
  50. - Generate reports of network activity that show the timed order of events.
  51. + The audio system (cmAudioSys) needs to be able to support multiple parallel DSP chains in separate threads.
  52. + UI Related Issues:
  53. - Automatic UI's should be generated by scanning the objects parameters.
  54. - Custom UI's should be created by explicitily naming object variables along with layout info.
  55. + Processors should be able to contain their own processing chains - embeddding processors should be possible.
  56. This naturally leads to a tree address space. (e.g. fx.filter.param1)
  57. + Communication between threads should not necessarily require serialization.
  58. - Use a blackboard approach where object values are read/written from a blackboard. Objects on the same
  59. thread use the same physical blackboard. Blackboards on remote processes stream data in the background.
  60. - This scheme may require double buffering of complex objects to prevent accessing invalid data states.
  61. + Objects that send multiple valued messages should use 'record' based data so that only one connection
  62. is necessary.
  63. STRATEGY:
  64. + implement highly parallel version - which can take advantage of multiple processors
  65. to do more - acoustic pitch tracking, source separation based fades
  66. TODO:
  67. + When a cmMsgList item is selected it should send out the selected index
  68. after the individual data items rather than before them - this way the index
  69. can be used as a definitive signal to perform some action on the previously
  70. transmitted data items.
  71. + cmJson and cmLex should report the name of the file when with syntax error messages.
  72. + Use CSV for mod score file format (to eliminate need for labels)
  73. + Autoload of default pgm cfg, program, enable audio, sequence, score location.
  74. + (done) Circuit switcher patch.
  75. + Review and document the app threading and locking during audio file loading.
  76. + Remove old performance evaluation code from cmScore.
  77. + All programs should be able to reload without crashing via enabling/disabling audio.
  78. + (in new version) Select and save audio devices.
  79. + (done) Mac Conversion
  80. + (done) Live MIDI - to test audio/MIDI delay.
  81. + (done) As threshold increases overall volume decreases - add a compensation algorithm.
  82. + There are unintialized variable warnings on the release build.
  83. + Add preset delete.
  84. + All uses of va_copy() should have a complimentary va_end()
  85. + The list boxes do not show the currently selected item.
  86. + (done)MIDI note messages are sent but do not trigger notes on OSX.
  87. *** Usage Notes
  88. live - Disconnects WT cmd input (WT will not receive an 'on' msg)
  89. Disconnects TL reset input (TL will not receive a 'reset' msg)
  90. Disconnects MFP sel input (MPF will not receive an 'on' msg)
  91. Switches audio input to KR from WT to AIN.
  92. Turn the 'meas' checkbox 'on'.
  93. simulate - Turn the 'audio in' checkbox 'on'.
  94. Switch MFP output from SF to Nano.
  95. (SF MIDI input then comes from the MIDI port.)
  96. audio in - Switch audio input to KR from WT to AIN.
  97. meas - Instruct the SF to generate measurement outputs for the active meas. unit.
  98. Otherwise the measurements must be loaded from the recorded measurment list.
  99. print - Print a report from the SF.
  100. quiet - Turn off SF output.
  101. LA Secs - Fragment recd/play unit look-ahead time in seconds.
  102. Fade DbpSec - Framgent recd/play unit fade out time
  103. (time to fade to 0 following a fade msg)
  104. *** Cross-fade Notes
  105. Cross-fades are initiated by sending any msg to the 'AvailCh.trig' input.
  106. The 'AvailCh' object then toggles the parameter router channels and xfader gates.
  107. Only after this should the new parameter values be transmitted either from
  108. the 'ActiveMeas' object (through the scale range chains) or directly from the
  109. 'ScMod' object.
  110. Sending parameter values prior to triggering the 'AvailCh' will result in the
  111. parameters being sent to the currently active 'Kr'. This will result in two
  112. possibly unintended effects:
  113. 1) The effect of the parameter will be heard immediately - possibly resulting in distortion.
  114. 2) If a subsequent trigger is sent to 'AvailCh' the parameters will be routed to
  115. the fade-out (current) channel rather than the fade-in (next) channel.
  116. **** Live Test Score
  117. + Line 1048 has a red G#5 immediiately following another G#5. Is this correct?
  118. For now the second G#5 has been marked as a 'skip'.
  119. + Changed Tempo sections 25,26,27 to 40,47,47a
  120. + Measurements are taken for sections 51-54 but these sections follow bar 136 and
  121. are therefore outside the test. These sections have therefore been redirected
  122. to the downbeat of 201-204.
  123. *** Testing Notes:
  124. + Equipment List:
  125. Four Microphones: Four performance/ Four recording
  126. 2 inside 2 inside
  127. 2 overhead 2 overhead
  128. 4 powered speakers
  129. 2 Mixers (1 performance 1 recording)
  130. Performance Computer (harpo)/ Audio Interface (delta1010) / MIDI interface (Fastlane)
  131. Recording Computer (crel) / Audio Interface (delta 1010) / MIDI interface (???)
  132. Sensor Strip
  133. + Performance Setup
  134. +------ +
  135. Mic0 ----------->| |
  136. Mic1 ----------->| | sends +-----+ +-------+
  137. Mic2 ----------->| |------->| A/D | | | +------+ +--------+
  138. Mic3 ----------->| Mixer |------->| |------>| harpo |<-------| MIDI |<-----| sensor |
  139. aux | | | | | | +------+ | strip |
  140. Spkr0 <----------| |<-------| |<------| | +--------+
  141. Spkr1 <----------| |<-------| | +------ +
  142. main | | | |
  143. Spkr2 <----------| |<-------| D/A |
  144. Spkr3 <----------| |<-------| |
  145. +-------+ +-----+
  146. + Software Development
  147. - Create Score File
  148. - Create Recording Program (test with long MIDI playback generating audio - look for drift)
  149. Record the index of each MIDI event at it's location in an audio channel.
  150. - Allow all variables and patch connections to be set from the scMod script and have
  151. multiple scripts with varynig effects setups.
  152. - When scanning past ramping variables in scMod the end value should be
  153. taken as the next variable(???) - this is not necessarily correct
  154. because one never knows where a timed change may end - maybe ramped
  155. variables should include a 'skip value' giving the next ambient value
  156. for the ramped variable - experiment with this to figure out what works.
  157. - Add comb filters tuned according to the current MIDI notes as an additional effect.
  158. - Add EQ output stage (use mixer).
  159. - Add an input Compressor.
  160. - The dry signal should be able to be routed to seperate output channels - around the compressor.
  161. (Better would be to output a delayed version of the dry signal that was in sync with the transformed
  162. signal - this might mean simply passing the dry version as separate outputs from KR).
  163. - Create a mode in scMod which increments values based on an onset detector. So that
  164. changes only happen on attacks. This still doesn't help if the pedal is down (or
  165. if notes are sustained) but otherwise might be a better way to ramp parameters.
  166. - (DONE) The ability for the measurements to be called at the correct time must be built in.
  167. (or alternatively to use stored effects).
  168. - (DONE) Effects applied to the playback fragments.
  169. - (DONE?) CROSSFADE BUG - This may be fixed by the change to cmDspAvailCh which
  170. handles the case where no channels are available by sending an error message
  171. but not actually changing the state of the cross fader.
  172. + Experiments:
  173. 1) Speaker placement and live/electronics mix.
  174. 2) Sliver mix level
  175. 3) Try varying degree's of effects
  176. *** 11/19
  177. + The recd/play fade should be able to trigger from a capture note as well as
  178. playback note. An offset might also be useful.
  179. (Should be a default fade for each fragment - keyed to the input. This
  180. will be the fade that will occur when
  181. + Allow setting fade time in the score.
  182. + Allow setting fade rate based on 1.0 to 0.0 from fade point to end point.
  183. + Allow for multiple fades markers per fragment. (what does this mean?)
  184. + Write code to ignore playback when the score follower is not stable -
  185. or to throw out fragments where there is a mistake.
  186. + At the end of each fragment recording the fragment should be truncated by the look-ahead time
  187. to avoid capturing the attack of the marked note.
  188. + Part 2 data analysis: analyze the order of notes in counter rhythms.
  189. + Allow 'evenness' sequences to have non-even relationships.
  190. *** 11/1
  191. + Change the wavetable to read stereo files or add a second wavetable to play the other channel.
  192. + The console window is not always updating from the bottom.
  193. + The 'Dump' button results are not going to the console window.
  194. + Put dry signal into separate output channels.
  195. + Add 'adaptive' mode parameters to scale/range mappings.
  196. + The 'meas'->'parameter' mappings should changable from scMod (mod0.js)
  197. + The 'adaptive' mode parameters (e.g. offset and invert) need to be connected in the patch.
  198. + Create a mode in scMod which increments values based on an onset detector. So that
  199. changes only happen on attacks. This still doesn't help if the pedal is down (or
  200. if notes are sustained) but otherwise might be a better way to ramp parameters.
  201. + Mark all notes in the score according to how well they would act as places to transition.
  202. Notes held while the pedal is down would not be good places to transition.
  203. These indicators would then be used to determine where a section change can occur
  204. when the actual section change is missed.
  205. + (done) All score_loc's and event indexes in meas0.js that beginning with
  206. score location 743 must be decremented by two. (e.g. loc 743 becomes 741 ...)
  207. (score_loc_1.txt is now the correct score file)
  208. + When scanning past ramping variables in scMod the end value should be
  209. taken as the next variable(???) - this is not necessarily correct
  210. because one never knows where a timed change may end - maybe ramped
  211. variables should include a 'skip value' giving the next ambient value
  212. for the ramped variable - experiment with this to figure out what works.
  213. *** 10/17
  214. Select bar 129.
  215. Start on F5 before 129.
  216. Score follower jumps to loc. 978 then backs up to 973.
  217. *** 9/27
  218. * (DONE) Implement live recording for use in part 2.
  219. * Implement a delay between when a new section is set to trigger and when
  220. it actually does. This might allow transitions to be set up prior to when they are heard.
  221. * (DONE) OS-X version crashes when the printf("PROCSET ...) is removed from _cmScProcSets().
  222. * Add ability to set mappings and perf. measure settings to scMod.
  223. * The scMod should play through all changes up to the cur starting location so that
  224. we can mimic the state of playing the piece through but allow starting from any location.
  225. * Experiment with changing settings using the scMod ramping functions.
  226. * Make a source separation based fade using an filter/inverse filter based on the
  227. spectrum prior to the cross-fade. As an extra feature notice the state of the pedal
  228. and decay appropriately.
  229. * The electronic score needs the ability to specify that the output is sent to the audible
  230. cross-fade channel rather than the inactive channel.
  231. * Add a score mode that performs some action (e.g. incr/decr) on each incoming score follower
  232. event.
  233. * In AvailCh what happens when no available channels are found - this may be the cause
  234. of the cross-fade cut-out problem.
  235. * Add eq stage to output.
  236. * Setting the upper slope to a negative value is effective.
  237. * Demo Material: Seq 8 Mark 204 (1024-4 2048-4) Fade 10ms Cost->Threshold.
  238. - recd7 Mark 151
  239. - recd8 Mark 145 - same settings a previous take.
  240. - (recd9) Mark 145 - w/ changing xfade switch to mode 4 at 38
  241. - recd10 Mark 145 - same w/ no mode change
  242. - recd11 Mark 145 - a. 3 different takes w/ score constant - no perf. parameters used - fixed threshold=65
  243. - recd12 Mark 148 b.(same settings as recd9)
  244. - recd13 Mark 151 c.
  245. - recd14 Mark 161 Section 40 - fixed changes see mod0.js for note
  246. - Seq 6 Mark 145 M-92 (38,39,39a) (recd16) (recd17) (recd18) (recd19)
  247. - Seq 7 Mark 167 M-100
  248. *** 8/13
  249. * Cross-fade was cutting out during demo.
  250. * Missing takes between seq 7 and seq 8.
  251. * The MIDI is mis-aligned against the audio.
  252. * Are cost / tempo working? ... test changing mode.
  253. * Add an automatic volume adjustment to prevent parameter changes from causing large volume changes.
  254. * What can we actually do between after a MIDI note is received? Is it already too late to
  255. send parameters w/o affecting the attack of the note.
  256. * Live Test material
  257. Part 1: Meas:76 - (one measure) to get measurments for later sections
  258. Part 1: Meas:94 (sect 38 (Seq 6)) through Meas:136 (include 136 stop at 137).
  259. Part 2: ("/Users/kevin/temp/piano score part 2 draft 1 master m 232 - 241 1st 2 bts.sib")
  260. The downbeat of part two aligns with the 3rd beat (in 4/4) of 122 in the first part.
  261. Ends on measure 131 at end of beat 3.
  262. MISSING MIDI for measures 114:126
  263. *** July 10
  264. ** (done) Make separate mappings and scale/range controls for left and right.
  265. ** Add EQ output stage.
  266. ** Work out the speaker setup.
  267. ** Add capture/playback. Analyze notes on capture and do not play if there are
  268. any wrong notes. (Skipped notes however are acceptable.)
  269. **
  270. ** When a section transition is occurs late - (e.g. due to dropped notes) do not apply
  271. the transformation all at once - instead either ramp it in or step it in on subsequent
  272. attacks. Section transitions which are positively identified are intended to have
  273. dramatic changes so applying the updated parameters immediately is acceptable - but
  274. when the parameters are changed mid-section they should be much applied subtely.
  275. ** (done) Build a database of measurements and setup the program to be able to
  276. apply a given measurement at it's assigned section.
  277. ** Redevelop spectral distortion algorithm to use a spline as the transform.
  278. ** use Log frequency frequency transform instead of FFT.
  279. ** Allow for a continous window size via zero padding.
  280. ** Add a 'write' preset file button - so the preset file can be saved prior to next crash.
  281. ** measurement values can generate MAX_DBL - be sure that are not being sent through to the audio algorithm.
  282. (see ln:965 cmDspKr.c for a hack to fix this)
  283. ** add invert to scale/range to cause output to go in opposite direction.
  284. ** non-grace eveneess are used to generate a measurement value from previous tempo calibration section.
  285. (non-grace evenness notes therefore have two scores: 'evenness' and 'overall-duration').
  286. ** note that the default setting for dyn and even.
  287. ** add dyn,even,tempo,cost number boxes to allow artificial setting of these parameters.
  288. add dyn,even,tempo,cost as modulator variables to allow them to be set from the modulator
  289. With these additions we can simulate apply measurements at the 'application' section.
  290. ** add begining and ending measure numbers to 'seq' labels
  291. ** Gain compensation for mode 4.
  292. ** Do IFFT using cos()^c + i * sin()^c - these bases are orthognal but cause harmonic distortion.
  293. To be efficient this might involve writing an FFT function.
  294. // May 22
  295. ** Crashers (Should be tested but are probably already fixed following score follower debugging.)
  296. Seq 2 m24 Mark 36 & 38
  297. Seq 4: Mark 115
  298. Seq 4: m76-79 Mark 129 (W/ meas: even & dyn -> thresh change min Thresh to 40) - use 4th seq w/ b1
  299. Seq 6: m92 Mark 143
  300. Seq 7: m103 Mark 173
  301. Seq 7: Mark 172 Meas 103 - always crashes on playback.
  302. Seq 4: Mark 76 Meas 40 - crash!
  303. Seq 2: First mark meas 23
  304. Crash seems to happen in cmProc4.c: _cmScMatchInitMtx() ln:1311.
  305. It looks like a memory overrun. Looks like the first line is wrong
  306. shouldn't:
  307. if( rn >p->mrn && cn > p->mcn )
  308. be
  309. if(rn*cn > p->mrn*p->mcn)
  310. BUGS:
  311. // Apr 20
  312. The tempo measurement can produce invalid values. Set score to 22 then play Mark 38.
  313. First tempo measurement is a non-sense value - probably produced by an div by zero.
  314. Also: Mark 8, Meas 10.
  315. Crash on playing Mark 37.
  316. Click on list control outside below list item - crash!
  317. Select Mark 171 (Seq 7) Section 43, m103 crash
  318. // Feb 27
  319. + Audio seems to preceded MIDI by around 250ms this probably arises
  320. from a delay that was inserted by 'mas'. Can the delay be removed?
  321. // Feb 25
  322. + Fix the audio file input/output ports
  323. //
  324. // Feb 6 & 7
  325. //
  326. + Performed notes which arrive which about 50ms could be
  327. considered chords. Extra notes notes which were not part of
  328. the chord are probably common and should be discounted during
  329. the cost analysis.
  330. + (done) Add alignment cost as a 4th variable along with dyn,even,tempo.
  331. + (done) In the score print out (score_loc.txt) Section 2 is starting on Bar 5
  332. when it should start inside Bar 7.
  333. + In Take 1 the 2nd dynamics set is not triggered.
  334. + In Take 3 Eveness 2 the C2 and F#4 are NOT missing althrough they are in the
  335. evalation. Also E7 which ends that set is not marked with an 'e'.
  336. + It is possible to have even-non-grace sets where the note rythm value's are not
  337. all the same (e.g. bar 20 )
  338. + User soft-thresholds for the dynamics categories.
  339. + Set 39 even measure 25 shows the first note as G#2 when it should be C#1
  340. + Missig MIDI note sounded:
  341. C1 score-loc:132
  342. A#2 140
  343. A5 173
  344. C#2 195 Marker 36
  345. E5,G33,A#2,C#1 212-218 Marker 36
  346. E3 185 Marker 37
  347. F5,C#4,G#5 986 Marker 204
  348. -------------------------------------------------------------------------------
  349. OSX - Install Notes
  350. -------------------------------------------------------------------------------
  351. 1. Install macports
  352. 2. Install git-core (sudo port install git-core)
  353. 3. sudo port install fftw-3
  354. 4. sudo port install fftw-3--single
  355. 5. port select --list gcc (which gcc is active)
  356. 6. sudo port install gcc47
  357. 7. sudo port --set gcc mp-gcc47
  358. 8. sudo port install fltk
  359. 9. sudo port install xorg-libX11
  360. 10.sudo port install git-core
  361. 11.sudo port install emacs +x11
  362. 10 install ~/Library/Preferences/org.larke.kc.txt (is this required?)
  363. 11 create ~/Library/Preferences/kc
  364. 12 Install ~/Library/Preferences/time_line.js, time_line_preset.js, time_line_preset.csv