Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Jay uses eclipse.org
  • anyone uses source forge -no
  • Todd - has a contributor license agreement, which is basically apaches - this is the same license agreement, this helps us -
    • we are virally benefiting here
  • most of development is done in apache, but we do a lot of customized code for special cases, those codes - manage outside of the project basis, in source forge or our own servers -
    • dual repository - dual development - the core - try a new capability, can't just dump it into apache, have to go through a process to get this into the core functionality -
    •  
    • If so, how do you manage distribution of project contributions that are not accepted by the core open source package distribution?
  • What relationships should we encourage with other possible distribution channels (including adoption by industry?)
    • public repository repositories (of also different type) - Frank Loffler
    • website downloads for Condor, mirrored - couple in the US and couple in Europe
      • and downstream distros, in package repositories for debian, etc
      • and package repositories -
      • ours get updated monthly, while downstream only take stable
      • since it is apache, can use in commerical products - there are commercial products of condor that aren't called condor -
      • some are closely work with us, RedHat Merge, G is "condor" - redhat has engineers that help work on condor, they actually have them live in madison, work side by side with UW developers.  do they contribute back to the condor source tree?  they make regular contributions back, and upstream - get contributions from users - some commercial users that don't contribute anything back
      • benefits from commercial side - George gets tests on weird machines, and gets bug reports back - this is a benefit -
        • Amber - has had students hired since they are experts in the code, eg, hired by Genentech - someone who understood amber thoroughly -
        • source - is "source available", encourage folks to get source and have them compile - this is a pain -to distribute binary -
        • windows - does give binaries, as windows is so nicely defined for this
        • changes in glibc in linux, is a huge problem
        • but this sort of problem in windows doesn't exist -
        • there are people running on windows clusters, but not exotic hw + windows -
        • bluegene - is a pain - but not many people with those machines, and all of the supercomputers can compile on, have people on the staff too that can do this
        • used to be fine to give single cpu versions of the code out as binaries
        • but a large amount of functionality in code only works in parallel - have to build in MPI layer -
        • downtream industry people have QA people, do more QA
          • get to run in environments that coudln't run in before -
          • when run in different environments, or in different ways, learn something -
        • Flash - weird relationship with industry, most of them want the code as works with any infrastructure, want code for their test suites - this has our pluses for us - as on several of the HPC platforms, get to be on the platform early, works out kinks of porting the code is extremely helpful to us
          • and several vendors licensing representative workload - low level adjustments need to make in the hw -
          • they modify their architecture
        • Amber - have industry use the code, and the other side - where sell machines, where code is a benchmark - nvidia is a classic example -
          • this is used in chip design, this is in the regression test for new nvidia gpus -
          • this is a nice interaction with industry -this is possible as there is a market for
        • benchmarks for systems
          • amber
          • flash
          • gamess
    • data licenses - what does this mean -
      • anyone see this?
      • flash - from certain of simulatinos, have made data public, along with the tools for data anlysis, but this is freely available -
      • two types of data - one can make from simulations, the other used as input, where the data may be more expensive to obtain -
        • certainly, data from simulation, at some point, gets distilled down to a paper, and is archived or deleted -
        • data licenses seem to have more applicability for input?
        • Anshu disagrees, as not everyone has access to machines -turbulence, is one community that keeps increasing resolution of the the simulations
        • in many fields, lack of data - data mining, esp with thing that has to do with proprietary data, improved method - some journals require you to make data available
          • not sure if they have requirement to make sw available - something else you are thinking?
        • Yan - not dealing with data licenses yet, but know it is coming
        • and another issue - host the data, host services, and send users - have to store the data a long time for users to get it back or permanently -
        • Ross - end user license for Anton machine at PSC, have to leave data on the machine -
        •  
      •  
  • What about dual licensing models that differentiate between commercial and non-commercial use?
    • Can this model be used to enable software sustainability?
  • What is your experience/relation with your institution's technology marketing office?

...