[2024 EXTENDED] Black Friday / Cyber Monday: FLASH SALE & MEGATHREAD

dev_vps · December 2024

@emgh said:

@dev_vps said:

@emgh said:

o1 pro very worth

My use case
https://community.openai.com/t/any-other-pro-users-using-o1-for-math/1055094

why make the llm do math?

make llm do python make python do math

Because I teach jee calculus

emgh · December 2024

@Savvy said:
Place 10 for comments now

good job

tansel · December 2024

@donli said:

@tansel said:

@kuroit said:

@raza19 said:
Dear @kuroit we r sincerely missing ur presence. I realize u r preparing for Christmas/boxing Friday deals but wt a pleasant surprise it wud be if u offered 7 Dollah Singapore deals at the auspasciois occasion of having reached 1000 pages. That's just good business and karma

You have been heard!

1 vCore
1GB DDR4 RAM
15GB SSD/NVMe Disk
1TB Bandwidth @ 1/10Gbps Uplink

Supported Locations:
West Midlands, UK
Dallas, USA
Tampa, USA
Los Angeles, USA
Ashburn, USA
The Netherlands
Singapore

Price: 7.77GBP/Year with promocode
Promocode: YOU-KNOW-WHO!
Stock: 5

Order: https://my.kuroit.com/store/sale-offers/bf-worldwide-1vcore-1gb-ram-15gb-disk

Is there still stock available? I missed it, and the promo code shows as expired.

Do you even know where you are!?

Am I causing you any trouble by asking if there’s still stock available?

JohnFilch123 · December 2024

@tansel said:

Am I causing you any trouble by asking if there’s still stock available?

Nope but the answer is kinda obvious.

Savvy · December 2024

@tansel said:

@donli said:

@tansel said:

@kuroit said:

@raza19 said:
Dear @kuroit we r sincerely missing ur presence. I realize u r preparing for Christmas/boxing Friday deals but wt a pleasant surprise it wud be if u offered 7 Dollah Singapore deals at the auspasciois occasion of having reached 1000 pages. That's just good business and karma

You have been heard!

1 vCore
1GB DDR4 RAM
15GB SSD/NVMe Disk
1TB Bandwidth @ 1/10Gbps Uplink

Supported Locations:
West Midlands, UK
Dallas, USA
Tampa, USA
Los Angeles, USA
Ashburn, USA
The Netherlands
Singapore

Price: 7.77GBP/Year with promocode
Promocode: YOU-KNOW-WHO!
Stock: 5

Order: https://my.kuroit.com/store/sale-offers/bf-worldwide-1vcore-1gb-ram-15gb-disk

Is there still stock available? I missed it, and the promo code shows as expired.

Do you even know where you are!?

Am I causing you any trouble by asking if there’s still stock available?

You forgot to close your comment with "reguards" and that is considered rude here

steny · December 2024

@plumberg said:

@steny said:
@plumberg said:

any recommendations for selfhosted llm which would help with code generation? Claude does amazingly well, but I end up making it work so much that I am rate-limited.

You need quite a lot of

@plumberg said:
any recommendations for selfhosted llm which would help with code generation? Claude does amazingly well, but I end up making it work so much that I am rate-limited.

For a self hosted LLM you need a lot of VRAM especially for coding. For home PC probably the best model you can run is currently Qwen2.5-32B-Coder . For coding, unlike chat you need at least half(8 bits) precision, so that means you will need around 40GB Vram, e.g. Dual rtx 3090, which is a setup I am using. Bigger models, like Qwen2.5-72B or LLama-Nemotron-70B are better, but you won't run it at home at that precision unless you build 4xGPU rig. For the largest open weight models like Mistral Large, you need dual H100 to run it, so you definitely need to rent GPU and won't be cheap but you get around a performance of GPT-4o in coding there, probably slightly less since you will still run it in half precision only.

First off, thanks for the detailed post.

So here is the deal. I have 0 GPU. But have a pair of dual E5-2699v4 pair with decent RAM (384 gb ddr4 2400 speeed or something).

I am not really interested in getting fastest responses. As long as it spits out decent I am game.

Or am I dreaming of hosting a llm ? What are your thoughts?

Running on Ram would be awfully slow, especially those large models you could theoretically run with that amount of ram.

emgh · December 2024

regar

emgh · December 2024

ds

emgh · December 2024

how many minutes of music did you guys listen to 2024

plumberg · December 2024

@steny said:

@plumberg said:

@steny said:
@plumberg said:

any recommendations for selfhosted llm which would help with code generation? Claude does amazingly well, but I end up making it work so much that I am rate-limited.

You need quite a lot of

@plumberg said:
any recommendations for selfhosted llm which would help with code generation? Claude does amazingly well, but I end up making it work so much that I am rate-limited.

For a self hosted LLM you need a lot of VRAM especially for coding. For home PC probably the best model you can run is currently Qwen2.5-32B-Coder . For coding, unlike chat you need at least half(8 bits) precision, so that means you will need around 40GB Vram, e.g. Dual rtx 3090, which is a setup I am using. Bigger models, like Qwen2.5-72B or LLama-Nemotron-70B are better, but you won't run it at home at that precision unless you build 4xGPU rig. For the largest open weight models like Mistral Large, you need dual H100 to run it, so you definitely need to rent GPU and won't be cheap but you get around a performance of GPT-4o in coding there, probably slightly less since you will still run it in half precision only.

First off, thanks for the detailed post.

So here is the deal. I have 0 GPU. But have a pair of dual E5-2699v4 pair with decent RAM (384 gb ddr4 2400 speeed or something).

I am not really interested in getting fastest responses. As long as it spits out decent I am game.

Or am I dreaming of hosting a llm ? What are your thoughts?

Running on Ram would be awfully slow, especially those large models you could theoretically run with that amount of ram.

How slow are we talking about? Any idea?

And will that change the quality of the output?

Reguards

_MS_ · December 2024

This place is dry like the desert, anybody lubin' the deals or what?

emgh · December 2024

MS said:
This place is dry like the desert, anybody lubin' the deals or what?

East Bound and Down!!!

_MS_ · December 2024

@emgh said:
how many minutes of music did you guys listen to 2024

Miss the guy.

tansel · December 2024

@tansel said:

@donli said:

@tansel said:

@kuroit said:

@raza19 said:
Dear @kuroit we r sincerely missing ur presence. I realize u r preparing for Christmas/boxing Friday deals but wt a pleasant surprise it wud be if u offered 7 Dollah Singapore deals at the auspasciois occasion of having reached 1000 pages. That's just good business and karma

You have been heard!

1 vCore
1GB DDR4 RAM
15GB SSD/NVMe Disk
1TB Bandwidth @ 1/10Gbps Uplink

Supported Locations:
West Midlands, UK
Dallas, USA
Tampa, USA
Los Angeles, USA
Ashburn, USA
The Netherlands
Singapore

Price: 7.77GBP/Year with promocode
Promocode: YOU-KNOW-WHO!
Stock: 5

Order: https://my.kuroit.com/store/sale-offers/bf-worldwide-1vcore-1gb-ram-15gb-disk

Is there still stock available? I missed it, and the promo code shows as expired.

Do you even know where you are!?

Am I causing you any trouble by asking if there’s still stock available?

Am I causing you any trouble by asking if there’s still stock available? Regards.

emgh · December 2024

@MS said: Miss the guy.

Yes ;(

I did about 52k minutes 24!

emgh · December 2024

@tansel said:

@tansel said:

@donli said:

@tansel said:

@kuroit said:

@raza19 said:
Dear @kuroit we r sincerely missing ur presence. I realize u r preparing for Christmas/boxing Friday deals but wt a pleasant surprise it wud be if u offered 7 Dollah Singapore deals at the auspasciois occasion of having reached 1000 pages. That's just good business and karma

You have been heard!

1 vCore
1GB DDR4 RAM
15GB SSD/NVMe Disk
1TB Bandwidth @ 1/10Gbps Uplink

Supported Locations:
West Midlands, UK
Dallas, USA
Tampa, USA
Los Angeles, USA
Ashburn, USA
The Netherlands
Singapore

Price: 7.77GBP/Year with promocode
Promocode: YOU-KNOW-WHO!
Stock: 5

Order: https://my.kuroit.com/store/sale-offers/bf-worldwide-1vcore-1gb-ram-15gb-disk

Is there still stock available? I missed it, and the promo code shows as expired.

Do you even know where you are!?

Am I causing you any trouble by asking if there’s still stock available?

Am I causing you any trouble by asking if there’s still stock available? Regards.

You asking yourself?

tansel · December 2024

@Savvy said:

@tansel said:

@donli said:

@tansel said:

@kuroit said:

@raza19 said:
Dear @kuroit we r sincerely missing ur presence. I realize u r preparing for Christmas/boxing Friday deals but wt a pleasant surprise it wud be if u offered 7 Dollah Singapore deals at the auspasciois occasion of having reached 1000 pages. That's just good business and karma

You have been heard!

1 vCore
1GB DDR4 RAM
15GB SSD/NVMe Disk
1TB Bandwidth @ 1/10Gbps Uplink

Supported Locations:
West Midlands, UK
Dallas, USA
Tampa, USA
Los Angeles, USA
Ashburn, USA
The Netherlands
Singapore

Price: 7.77GBP/Year with promocode
Promocode: YOU-KNOW-WHO!
Stock: 5

Order: https://my.kuroit.com/store/sale-offers/bf-worldwide-1vcore-1gb-ram-15gb-disk

Is there still stock available? I missed it, and the promo code shows as expired.

Do you even know where you are!?

Am I causing you any trouble by asking if there’s still stock available?

You forgot to close your comment with "reguards" and that is considered rude here

Thank you for your guidance, I have learned it. Regards.

_MS_ · December 2024

@emgh said: Yes ;(

His documentary was always painful to watch.
Avicii: True Stories (2017)

@emgh said: I did about 52k minutes 24!

Don't know. Still use offline media like a chad data hoarder.

tansel · December 2024

@JohnFilch123 said:

@tansel said:

Am I causing you any trouble by asking if there’s still stock available?

Nope but the answer is kinda obvious.

I don’t think there’s anything wrong with inquiring about the stock. Did I violate any forum rules? If so, I apologize to you. Regards

emgh · December 2024

@tansel said:

@JohnFilch123 said:

@tansel said:

Am I causing you any trouble by asking if there’s still stock available?

Nope but the answer is kinda obvious.

I don’t think there’s anything wrong with inquiring about the stock. Did I violate any forum rules? If so, I apologize to you. Regards

He meant because it's such a good price, and only 5 in stock, it'll obviously be sold out within minute(s)

This discussion isn't worth continuing regards

steny · December 2024

@plumberg said:

@steny said:

@plumberg said:

@steny said:
@plumberg said:

any recommendations for selfhosted llm which would help with code generation? Claude does amazingly well, but I end up making it work so much that I am rate-limited.

You need quite a lot of

@plumberg said:
any recommendations for selfhosted llm which would help with code generation? Claude does amazingly well, but I end up making it work so much that I am rate-limited.

For a self hosted LLM you need a lot of VRAM especially for coding. For home PC probably the best model you can run is currently Qwen2.5-32B-Coder . For coding, unlike chat you need at least half(8 bits) precision, so that means you will need around 40GB Vram, e.g. Dual rtx 3090, which is a setup I am using. Bigger models, like Qwen2.5-72B or LLama-Nemotron-70B are better, but you won't run it at home at that precision unless you build 4xGPU rig. For the largest open weight models like Mistral Large, you need dual H100 to run it, so you definitely need to rent GPU and won't be cheap but you get around a performance of GPT-4o in coding there, probably slightly less since you will still run it in half precision only.

First off, thanks for the detailed post.

So here is the deal. I have 0 GPU. But have a pair of dual E5-2699v4 pair with decent RAM (384 gb ddr4 2400 speeed or something).

I am not really interested in getting fastest responses. As long as it spits out decent I am game.

Or am I dreaming of hosting a llm ? What are your thoughts?

Running on Ram would be awfully slow, especially those large models you could theoretically run with that amount of ram.

How slow are we talking about? Any idea?

And will that change the quality of the output?

Reguards

The inference speed is mainly dependend on memory bandwidth, Dual rtx 3090 runs 70B model around 15-20 tokens/second. There is some speed loss due to dual setup, yet DDR 2400 bandwidth is about 50xtimes less of 3090, So expect bellow 1 Token per second, where token is like 3-4 characters. And that is just a middle sized models, the large ones would be in fractions of token per second. The quality would be the same though.

stackr · December 2024

@emgh said:
This discussion isn't worth continuing regards

Can continue just for the regards

plumberg · December 2024

@steny said:

@plumberg said:

@steny said:

@plumberg said:

@steny said:
@plumberg said:

any recommendations for selfhosted llm which would help with code generation? Claude does amazingly well, but I end up making it work so much that I am rate-limited.

You need quite a lot of

@plumberg said:
any recommendations for selfhosted llm which would help with code generation? Claude does amazingly well, but I end up making it work so much that I am rate-limited.

For a self hosted LLM you need a lot of VRAM especially for coding. For home PC probably the best model you can run is currently Qwen2.5-32B-Coder . For coding, unlike chat you need at least half(8 bits) precision, so that means you will need around 40GB Vram, e.g. Dual rtx 3090, which is a setup I am using. Bigger models, like Qwen2.5-72B or LLama-Nemotron-70B are better, but you won't run it at home at that precision unless you build 4xGPU rig. For the largest open weight models like Mistral Large, you need dual H100 to run it, so you definitely need to rent GPU and won't be cheap but you get around a performance of GPT-4o in coding there, probably slightly less since you will still run it in half precision only.

First off, thanks for the detailed post.

So here is the deal. I have 0 GPU. But have a pair of dual E5-2699v4 pair with decent RAM (384 gb ddr4 2400 speeed or something).

I am not really interested in getting fastest responses. As long as it spits out decent I am game.

Or am I dreaming of hosting a llm ? What are your thoughts?

Running on Ram would be awfully slow, especially those large models you could theoretically run with that amount of ram.

How slow are we talking about? Any idea?

And will that change the quality of the output?

Reguards

The inference speed is mainly dependend on memory bandwidth, Dual rtx 3090 runs 70B model around 15-20 tokens/second. There is some speed loss due to dual setup, yet DDR 2400 bandwidth is about 50xtimes less, So expect bellow 1 Token per second, where token is like 3-4 characters. And that is just a middle sized models, the large ones would be in fractions of tokens per second.

Gotcha. Well I wanna try it out though and see where it takes me. Thanks.

emgh · December 2024

@stackr said:

@emgh said:
This discussion isn't worth continuing regards

Can continue just for the regards

regards

donli · December 2024

@emgh said:

@tansel said:

@JohnFilch123 said:

@tansel said:

Am I causing you any trouble by asking if there’s still stock available?

Nope but the answer is kinda obvious.

I don’t think there’s anything wrong with inquiring about the stock. Did I violate any forum rules? If so, I apologize to you. Regards

He meant because it's such a good price, and only 5 in stock, it'll obviously be sold out within minute(s)

This discussion isn't worth continuing regards

And by "minutes" he meant "seconds". Regards.

plumberg · December 2024

Regards
Reguards
Re guards
Re guar ds

Wich it s corect?

emgh · December 2024

@plumberg said:
Regards
Reguards
Re guards
Re guar ds

Wich it s corect?

reg

stackr · December 2024

@donli said:

@emgh said:

@tansel said:

@JohnFilch123 said:

@tansel said:

Am I causing you any trouble by asking if there’s still stock available?

Nope but the answer is kinda obvious.

I don’t think there’s anything wrong with inquiring about the stock. Did I violate any forum rules? If so, I apologize to you. Regards

He meant because it's such a good price, and only 5 in stock, it'll obviously be sold out within minute(s)

This discussion isn't worth continuing regards

And by "minutes" he meant "seconds". Regards.

Every out of stock deal saves at least $7 . Regards

emgh · December 2024

@stackr said:

@donli said:

@emgh said:

@tansel said:

@JohnFilch123 said:

@tansel said:

Am I causing you any trouble by asking if there’s still stock available?

Nope but the answer is kinda obvious.

I don’t think there’s anything wrong with inquiring about the stock. Did I violate any forum rules? If so, I apologize to you. Regards

He meant because it's such a good price, and only 5 in stock, it'll obviously be sold out within minute(s)

This discussion isn't worth continuing regards

And by "minutes" he meant "seconds". Regards.

Every out of stock deal saves at least $7 . Regards

onidel saved you just a little over 3 bux reg

emgh · December 2024

reg my i m hard

Howdy, Stranger!

Categories

In this Discussion

[2024 EXTENDED] Black Friday / Cyber Monday: FLASH SALE & MEGATHREAD

Comments

You have been heard!

You have been heard!

You have been heard!

You have been heard!

You have been heard!

Howdy, Stranger!

Quick Links

Categories

In this Discussion

[2024 EXTENDED] Black Friday / Cyber Monday: FLASH SALE & MEGATHREAD

Comments

You have been heard!

You have been heard!

You have been heard!

You have been heard!

You have been heard!