More

dylanbfox · on March 10, 2022

Looks interesting! Do you guys offer any sort of visibility tools/reports into customer usage of different endpoints, tracking of actual API requests (including payloads/etc) per request, etc? Is it possible to use you guys just for billing if we already have our own auth?

asommer12 · on March 10, 2022

Hey Dylan! A bit more context on each of your questions:

With regard to endpoint visibility, yes, totally. One of our strengths is the ability to even price different endpoints differently. If you have endpoints that are high-value that you want to track differently than other endpoints, you can simply tag it, and Kable will monitor that endpoint as its own dimension. So yes, you can see which customers are calling which endpoints through our dashboard.

With regard to tracking request payloads, this is something we do not currently support. We've given this some thought, and while we might change our position in the future based on customer feedback, we are currently not storing customer request data. The reason for this is privacy. We want to be careful about what information we store from our customers' customers, so we require developers to explicitly define what information gets recorded in Kable.

Finally, with respect to billing and analytics and authentication as separate features or products: *ABSOLUTELY*. You can use Kable for billing and analytics without using Kable authentication. We want it to be super easy to use both, but we understand many customers have other systems for managing API keys or other parts of the stack. In this case, we recommend using Kable's `record` method, and using us for our core competencies of billing and analytics.

dylanbfox · on March 7, 2022

Agree. The places I go on the web have become more and more centralized/limited. I think projects like this that help to surface and aggregate interesting content from the web (which is really what I come to HN to find) are great.

dylanbfox · on Dec 6, 2021

Nice work! The UI is really simple - and love not having to log in to use it. Have you thought about leveraging the ListenNotes API (https://www.listennotes.com/api/) to automatically pull in the podcast episodes via search vs having to upload them?

rememberlenny · on Dec 6, 2021

This is incredible. Thank you!!

dylanbfox · on Nov 25, 2021

I agree with all your points - but one thing I think about is: how do we fix what we have today? How do you fix the concrete jungles that most cities are today in the US. Or is it inevitable that more concrete will just be poured over time until some major natural disaster allows for a reset?

DoreenMichele · on Nov 25, 2021

The Netherlands has reversed course from a car-centric culture to a more pedestrian- and bike-friendly culture. They did that very intentionally. As far as I can tell, there is no magic about it being The Netherlands -- except possibly the influence on the culture of the polder.

https://en.m.wikipedia.org/wiki/Polder

From what I gather, protecting the polder so everyone survives is so important that it has, at times, required people at war to cooperate in keeping the water out.

Today, globally, everyone knows we need solutions here or we are all doomed. We can stop quibbling about what various factions want for personal gain and start seeking answers that help some group live lighter on the land so we all benefit.

Rinse and repeat.

Start with low-hanging fruit. As that gets done, other things will become more reachable.

_abox · on Nov 25, 2021

I think the Netherlands is still one of the more car-centric cultures in Europe though. And I've lived in several countries. Perhaps the inner cities aren't, but outside of them it's very hard to do without a car.

In many small villages/towns there is only a bus service during peak hours now. When I visit it's really a royal pain (and taxies are unaffordable as an alternative). And when you work in the Netherlands you're usually required to work nationwide which means countless hours in the car visiting clients. Public transport takes several times as long as car travel.

I really hated it when I worked there (I'm from there as you might have guessed). All these hours driving in frantic traffic were so stressful. I work in Barcelona now where public transport is much better (rural is still worse than inner-city but both are much better than the Netherlands' services). It's the first place I've lived where I genuinely don't need a car, it would only be a burden to me. Time between metros is counted in seconds and the network is so big. As well as that there's buses and trams and regional trains passing through that can be used to hop from one side of the city to the other.

The only thing that's genuinely better in the Netherlands is the bike lane network IMO :) That really is amazing. But I just don't see the feasibility of doing without a car completely there.

TacticalCoder · on Nov 25, 2021

> I work in Barcelona now where public transport is much better...

Barcelona is one of the most stressful, if not the most stressful city, to use public transport in in Europe. If anything it should serve as an example for other cities as to what to not do. Public transports are full of thieves and scammers. It is simply beyond belief. It is known that if you ever dare to retaliate when you catch people stealing you, it can quickly degenerate very badly... For you.

I don't want to hear the typical: "If you look like a local and know what not to do, you'll be fine". I want public transports to be very safe otherwise I won't use them.

Several people mentioned Tokyo already: I spent close to a year there. Now that is a city with working and safe public transports.

_abox · on Nov 26, 2021

Yes Barcelona has a big problem with pickpockets indeed. This is more of a legal issue: any theft of 400 euro or less is punishable by a fine only even if it's the 4000th time.

Because of this there's gangs of professional pickpockets. But they're trained to avoid conflict. Because any violence will incur serious charges. I've grabbed one once and pushed him against a wall when I felt him reaching for my phone and he literally was passive and relaxed. Just dropped the phone and strolled off. No way someone behaves like that unless they have trained it.

So yes they're an absolute plague but the risk of violence is low. And the cause does not have much to do with public transport (it happens on the street too) but with ignorant lawmakers :)

krageon · on Nov 25, 2021

> Public transports are full of thieves and scammers

This is true in the overwhelming majority of public transport around the world.

> Several people mentioned Tokyo

Yes, Japan is one of the safest nations in this regard. That's cool, but definitely not achievable everywhere because it is due to a lot of environmental factors (like the culture and the incredible conviction rates).

DoreenMichele · on Nov 25, 2021

I think your argument only strengthens mine. It sounds like you are saying "The Netherlands was kind of the America of Europe in terms of being crazy car-centric and they sucked so much worse than they do currently."

To which I say "So you're telling me if they can become the global poster child for doing this better, anyone can do this better -- even the US."

_abox · on Nov 26, 2021

Well, I don't know how much they suck now as I've left 10 years ago, but I know pre-corona there were many cases of the largest traffic jams ever reported in the news, almost weekly :)

When I was there these numbers were much lower despite the A2 being a horrible ever-changing construction site. And it was already so frustrating I once broke my teeth just from grinding it :(

At the same time even the frequent train routes were apparently really overcrowded. Everything is different now with Corona of course but that's worldwide.

dylan604 · on Nov 25, 2021

It also helps that the country is old enough to have developed before the automobile. Kinda hard to put the toothpaste back in the tube when you've designed entire cities around support the automobile.

occz · on Nov 25, 2021

>It also helps that the country is old enough to have developed before the automobile. Kinda hard to put the toothpaste back in the tube when you've designed entire cities around support the automobile.

I think this is a misconception for sure - the netherlands used to have much more car-oriented development but saw the issues with it and started retrofitting their built environments.

The same goes for the U.S in reverse, actually, with a lot of cities not having been designed with the car in mind and then subsequently having been demolished and retrofitted for the car. The transition is totally possible to do in both directions.

unmole · on Nov 25, 2021

> As far as I can tell, there is no magic about it being The Netherlands

While not quite magic, a famously flat terrain and a mild climate surely had an impact?

Cthulhu_ · on Nov 25, 2021

> how do we fix what we have today? How do you fix the concrete jungles that most cities are today in the US.

Elbow grease. Break the concrete and plant shit, pay people to maintain these things every once in a while. I mean most amenity planting is low maintenance and only needs looking after once or twice a year to avoid it growing wild.

The US has the solutions already - a lot of money, and a lot of people looking for a steady job. All it takes is for people to stop hoarding said money and Decide to solve the issue.

KennyBlanken · on Nov 25, 2021

Somerville, Massachusetts had a large Italian population a generation or two back and it seems damn near all of them considered "making it" covering every square inch of their land with concrete. Then putting up a 2 foot high brick wall topped with a few-feet-tall wrought iron fence.

The city has been working to undo it by providing financial incentives for removal, and partnering with an informal group of volunteers to make it cheap to do. They put out a call for public volunteers to have a sort of reverse-barn-raising. A big bunch of people show up and help rip apart and load up the concrete.

Jamaica Plain, a neighborhood in Boston, used to have a group of volunteers that worked with local nurseries to plant a tree and take care of the initial critical care (watering regularly) for free. All you had to do was email them, and then point when the crew showed up.

micromacrofoot · on Nov 25, 2021

local politics is the only way

presentation · on Nov 25, 2021

Agreed but given American politics this days, a well-placed earthquake will probably happen first.

dylanbfox · on Oct 8, 2021

Hi there - OP here - thanks for reading!

This blog is more of an intro to a few high level concepts (multi-GPU and multi-node training, fp32 vs fp16, buying hardware and dedicated machines vs AWS/GCP, etc) for startups that are early into their deep learning journey, and that might need a nudge in the right direction.

If you're looking for a deep dive into the best GPUs to buy (cost/perf, etc), the link in the below comment gives a pretty good overview.

PS - I can send you some benchmarks we did that show (at least for us) Horovod is ~10% faster than DDP for multi-node training FWIW. Email is in my profile!

dylanbfox · on Oct 7, 2021

Author here. Thanks for your comments!

In general - this is expensive stuff. Training big, accurate models just requires a lot of compute, and there is a "barrier to entry" wrt costs, even if you're able to get those costs down. I think it's similar to startups not really being able to get into the aerospace industry unless they raise lots of funding (ie, Boom Supersonic).

Practically speaking though, for startups without funding, or access to cloud credits, my advice would be to just train the best model you can, with the compute resources you have available. Try to close your first customer with an "MVP" model. Even if your model is not good enough for most customers - you can close one, get some incremental revenue, and keep iterating.

When we first started (2017), I trained models that were ~1/10 the size of our current models on a few K80s in AWS. These models were much worse compared to our models today, but they helped us make incremental progress to get to where we are now.

dylanbfox · on Oct 7, 2021

Dylan from Assembly here. If you want to send me one of your audio files (my email is in my profile) I'd be happy to send you back the diarized results from our API.

You can also signup for a free account and test from the dashboard without having to write any code if that's easier.

Other than lots of crosstalk in your group conversations - is there anything else challenging about your audio (eg, distance from microphones, background noise, etc?)

dylanbfox · on Oct 7, 2021

Great question. This is technically referred to as "Wake Word Detection". You run a really small model locally that is just processing 500ms (for example) of audio at a time through a light weight CNN or RNN. The idea here is that it's just binary classification (vs actual speech recognition).

There are some open source libraries that make this relatively easy:

- https://github.com/Kitt-AI/snowboy (looks to be shutdown now) - https://github.com/cmusphinx/pocketsphinx

This avoids having to stream audio 24x7 to a cloud model which would be super expensive. This being said, I'm pretty sure what the Alexa does, for example, is send any positive wake word to a cloud model (that is bigger and more accurate) to verify the prediction of the local wake word detection model AFAIK.

Once you are positive you have a positive wake word detected - that's when you start streaming to an accurate cloud based transcription model like Assembly to minimize costs!

dylanbfox · on Oct 7, 2021

Interesting. How do you guys manage spot interruptions when training on spot instances?

etrain · on Oct 7, 2021

Users expose their model to our Trial API (https://docs.determined.ai/latest/topic-guides/model-definit...), the base class then implements a training loop (which can be enhanced with user-supplied callbacks, metrics, etc.) that has a whole bunch of bells and whistles. Easy distributed (multi-GPU and multi-node) training, automatic checkpointing, fault tolerance, etc.

Concretely, the system is regularly taking checkpoints (which include model weights and optimizer state) and so if the spots disappear (as they do), the system has enough information to resume from where things were last checkpointed when resources become available again.

dylanbfox · on Oct 7, 2021

This is tricky. The de facto metric to evaluate an ASR model is Word Error Rate (WER). But results can vary widely depending on the pre-processing that's done (or not done) to transcription text before calculating a WER.

For example if you take the WER of "I live in New York" and "i live in new york" the WER would be 60% because you're comparing a capitalized version vs an uncapitalized version.

This is why public WER results vary so widely.

We publish our own WER results and normalize the human and automatic transcription text as much as possible to get as close to "true" numbers as possible. But in reality, we see a lot of people comparing ASR services simply by doing diffs of transcripts.