Firefly | Make technology more simple , Make life more intellingent.

Putting Large Models
into a Small Box

——AIBOX-1684X AI Box

Up to 32 TOPS of computing power! The AI box supports the private
deployment of mainstream large models,
bringing private AI capability to meet individual AI deployment needs

The private deployment of complete
large language models

32TOPS BM1684X AI processor
with ultra-high computing power

BM1684X, the SOPHON AI processor, features an octa-core ARM Cortex-A53 with up to 2.3 GHz of frequency and a 12nm lithography process.
With up to 32Tops (INT8) computing power, 16TFLOPS (FP16/BF16), or 2Tops (FP32) high-precision computing power,
it supports mainstream programming frameworks, and can be widely used in artificial intelligence inference for cloud and edge applications.

Powerful multi-channel video AI performance

The AI box supports up to 32 channels of 1080P H.264 video decoding and 32 channels of 1080P HD video processing (decoding +
AI analysis),making it ideal for various AI applications such as face detection and license plate recognition on video streaming.

Strong network communication capability

With dual 1000Mbps Ethernet, the AI box ensures high-speed and stable network communication,
meeting the needs of various application scenarios.

All-aluminum alloy enclosure for heat dissipation

The industrial-grade all-metal enclosure with aluminum alloy structure for thermal conduction. The side of the top cover features a
grille design for external airflow and efficient heat dissipation, ensuring computing performance and stability even under
high-temperature operating conditions. Its top cover is a porous hexagonal design, combining elegance with high efficiency.
The compact, exquisite device operates stably and meets the needs of various industrial-grade applications.

Abundant resources

We offer SDKs, tutorials, technical documentation, and development tools to streamline and improve the development process

A wide range of applications

The device is widely used in intelligent surveillance, AI education, services based on computing power, edge computing,
private deployment of large models, and data security and privacy protection

Surveillance

AI education

Computing services

Edge computing

Private deployment
of large models

Data security

Specifications

		AIBOX-1684X	AIBOX-1684
Basic Specificat	SOC	SOPHON BM1684X	SOPHON BM1684
	CPU	High-performance octa-core ARM A53, 12nm lithography process, frequency up to 2.3 GHz
	TPU	32TOPS (INT8)、16TFLOPS (FP16/BF16)、2TFLOPS (FP32)	17.6TOPS (INT8), 2.2TOPS (FP32), 35.2TOPS (INT8, enable winograd)
	VPU	32-channel H.264 1080P@25fps video decoding 1-channel 8K@25fps video decoding 32-channel 1080P@25fps processing decoding + AI analysis 12-channel H.264 1080P@25fps video encoding JPEG image encoding and decoding can reach 1080P@600fps	32-channel H.264 1080P@30fps video decoding 2-channel H.264 1080P@25fps video encoding MJPEG image encoding decoding can reach 1080P@480fps
	RAM	8GB/12GB/16GB LPDDR4/LPDDR4X	6GB/12GB/16GB LPDDR4/LPDDR4X
	Storage	32GB/64GB/128GB eMMC、1 × TF Card
	Power	DC 12V/4A (5.5 × 2.5mm)	DC 12V/3A (5.5 × 2.5mm)
	Power consumption	Normal: 20.4W(12V/1700mA), Max: 43.2W(12V/3600mA)	Normal: 9.6W(12V/800mA), Max: 26.4W(12V/2200mA)
	System	Linux
	Software Support	・ The private deployment of ultra-large-scale parameter models 　　under the Transformer architecture, including large language 　　models such as LLaMa2, ChatGLM, and Qwen, as well as major visual 　　models like ViT, Grounding DINO, and SAM. ・ The private deployment of the Stable Diffusion V1.5 image 　　generation model in the AIGC field. ・ Traditional network architectures such as CNN, RNN, and LSTM; a 　　variety of deep learning frameworks, including TensorFlow, PyTorch, 　　MXNet, PaddlePaddle, Caffe and ONNX, as well as custom operator 　　development ・ Docker container management technology	・ Traditional network architectures such as CNN, RNN, and LSTM; a 　　variety of deep learning frameworks, including TensorFlow, PyTorch, 　　MXNet, PaddlePaddle, Caffe and ONNX, as well as custom operator 　　development ・ Docker container management technology
	Size	90.6mm × 84.4mm × 48.5mm
	Weight	≈ 420g
	Environment	Operating temperature: -20℃～60℃, Storage temperature: -20℃～70℃, Storage humidity: 10%～90%RH (non-condensing)
Interface Specif	Ethernet	2 × Gigabit Ethernet (1000Mbps/RJ45)
	USB	2 × USB3.0 (Max: 1A), 1 × Type-C (Debug serial)
	Button	1 × Power button

Edge Computing

AIBOX

Mini PC

Others

AI Servers

ARM Cluster Servers

RISC-V Series

Gateway Servers

Developer Kit

Modules

Edge Computing

Server

Mini SBC

Standard

Compact Mainbord

AI Compute Modules

Commercial Modules

Industrial Modules

AI Camera

Face Recognition

Standard Table

Development Kits

Accessories

Intelligent Voice Control Kit

Product selection

Product comparison

AI Servers

ARM Cluster Servers

RISC-V Series

Gateway Servers

Product selection

Product develop

Developer Kit

Modules

Edge Computing

Server

Customization

32TOPS BM1684X AI processor with ultra-high computing power

Powerful multi-channel video AI performance

Strong network communication capability

All-aluminum alloy enclosure for heat dissipation

Abundant resources

A wide range of applications

Specifications

T-Chip Group

Service and Support

About Firefly

Websites

Contact Us

Follow Us

32TOPS BM1684X AI processor
with ultra-high computing power